wave-on-discord
/

silly-v0.2

Text Generation

text-generation-inference

Model card Files Files and versions

wave-on-discord commited on 11 days ago

Commit

f5654ea

·

verified ·

1 Parent(s): 1e26da4

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -10,6 +10,7 @@ Finetune of [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Ne
 - 2 epochs of SFT on RP data, then about an hour of PPO on 8xH100 with [POLAR-7B RFT](https://github.com/RowitZou/POLAR_RFT)
 - Kind of wonky, if you're dealing with longer messages you may need to decrease your temperature
 - Reviews:
 > its typically good at writing, v good for 12b, coherent in RP, follows context and starts conversations well

 - 2 epochs of SFT on RP data, then about an hour of PPO on 8xH100 with [POLAR-7B RFT](https://github.com/RowitZou/POLAR_RFT)
 - Kind of wonky, if you're dealing with longer messages you may need to decrease your temperature
+- ChatML chat format
 - Reviews:
 > its typically good at writing, v good for 12b, coherent in RP, follows context and starts conversations well