--- library_name: transformers pipeline_tag: text-generation base_model: - meta-llama/Llama-3.2-3B --- ## UFT This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984). Code: https://github.com/liumy2010/UFT ## References * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)