Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
jcmei
/
mistral-7b-instruct-sppo-iter1
like
0
Safetensors
synthetic_data_llama-3-8b-instruct-dpo-iter1_score
llama
alignment-handbook
Generated from Trainer
trl
sppo
License:
llama3
Model card
Files
Files and versions
xet
Community
428cea7
mistral-7b-instruct-sppo-iter1
1.52 kB
1 contributor
History:
1 commit
jcmei
initial commit
428cea7
verified
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago