Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
jcmei
/
llama-3-8b-instruct-dpo-iter2
like
0
Safetensors
synthetic_data_llama-3-8b-instruct-dpo-iter2_score
llama
alignment-handbook
Generated from Trainer
trl
sppo
Model card
Files
Files and versions
Community
07671e6
llama-3-8b-instruct-dpo-iter2
1.52 kB
1 contributor
History:
1 commit
jcmei
initial commit
07671e6
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago