liumy2010
/

Llama-3.2-3B-countdown-R3

Text Generation

text-generation-inference

Model card Files Files and versions

liumy2010 commited on May 29

Commit

fcc4b2c

·

verified ·

1 Parent(s): 27eced9

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +4 -0

README.md ADDED Viewed

	@@ -0,0 +1,4 @@


1	+ ## References
2	+
3	+ * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)
4	+