Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • SoraWatermarkRemover

  • Log In
  • Sign Up

thorirhrafn
/
gpt1B_DPO_model_ver2

PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
Model card Files Files and versions
xet
Metrics Training metrics Community
gpt1B_DPO_model_ver2 / runs
45.9 kB
  • 1 contributor
History: 2 commits
thorirhrafn's picture
thorirhrafn
Training in progress, epoch 1
9297959 verified over 1 year ago
  • Apr23_09-12-26_gpu-2
    Training in progress, epoch 0 over 1 year ago
  • Apr23_09-19-01_gpu-8
    Training in progress, epoch 1 over 1 year ago