File size: 116 Bytes
fcc4b2c
 
 
 
1
2
3
4
## References

    * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)