Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • VoxCPM

  • Log In
  • Sign Up

jcmei
/
llama-3-8b-instruct-dpo-iter2

Safetensors
llama
alignment-handbook
Generated from Trainer
trl
sppo
Model card Files Files and versions Community
llama-3-8b-instruct-dpo-iter2
1.52 kB
  • 1 contributor
History: 1 commit
jcmei's picture
jcmei
initial commit
07671e6 verified 10 months ago
  • .gitattributes
    1.52 kB
    initial commit 10 months ago