Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
liumy2010
/
Llama-3.2-3B-countdown-R3
like
1
Text Generation
Transformers
Safetensors
llama
text-generation-inference
arxiv:
2505.16984
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
fcc4b2c
Llama-3.2-3B-countdown-R3
/
README.md
liumy2010
Upload README.md with huggingface_hub
fcc4b2c
verified
5 months ago
preview
code
|
raw
Copy download link
history
blame
116 Bytes
References
* [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)