Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
liumy2010
/
Llama-3.2-3B-countdown-R3
like
1
Text Generation
Transformers
Safetensors
llama
text-generation-inference
arxiv:
2505.16984
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
liumy2010
commited on
May 29
Commit
fcc4b2c
·
verified
·
1 Parent(s):
27eced9
Upload README.md with huggingface_hub
Browse files
Files changed (1)
hide
show
README.md
+4
-0
README.md
ADDED
Viewed
@@ -0,0 +1,4 @@
1
+
## References
2
+
3
+
* [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)
4
+