Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Holarissun
/
RM-HH-GPT2-4w_helpful_gpt3_loraR64_40000_gpt2-large_shuffleTrue_extractchosenFalse

PEFT
Safetensors
trl
reward-trainer
Generated from Trainer
Model card Files Files and versions Community
RM-HH-GPT2-4w_helpful_gpt3_loraR64_40000_gpt2-large_shuffleTrue_extractchosenFalse
1.52 kB
  • 1 contributor
History: 1 commit
Holarissun's picture
Holarissun
initial commit
e29716c verified over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago