Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • VoxCPM

  • Log In
  • Sign Up

jan-hq
/
AlphaMaze-v0.2-1.5B-GRPO-cp-600

Text Generation
Transformers
Safetensors
English
qwen2
text-generation-inference
unsloth
trl
conversational
Model card Files Files and versions
xet
Community
AlphaMaze-v0.2-1.5B-GRPO-cp-600
2.09 kB
  • 1 contributor
History: 2 commits
jan-hq's picture
jan-hq
Upload README.md with huggingface_hub
0eef29d verified 7 months ago
  • .gitattributes
    1.52 kB
    initial commit 7 months ago
  • README.md
    571 Bytes
    Upload README.md with huggingface_hub 7 months ago