liumy2010's picture
Upload README.md with huggingface_hub
fcc4b2c verified
|
raw
history blame
116 Bytes

References

* [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)