CodeGoat24
/

FLUX.1-dev-PrefGRPO

Model card Files Files and versions

CodeGoat24 commited on 25 days ago

Commit

8ac1768

·

verified ·

1 Parent(s): 6d3c106

Update README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ This model is trained using [Pref-GRPO](https://codegoat24.github.io/UnifiedRewa
 For further details, please refer to the following resources:
-- 📰 Paper:
 - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/Pref-GRPO
 - 🤗 UniGenBench: https://github.com/CodeGoat24/UniGenBench
 - 🤗 Leaderboard: https://huggingface.co/spaces/CodeGoat24/UniGenBench_Leaderboard
@@ -51,5 +51,10 @@ image.save("flux-dev.png")
 ## Citation
 ```
 ```

 For further details, please refer to the following resources:
+- 📰 Paper: https://arxiv.org/pdf/2508.20751
 - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/Pref-GRPO
 - 🤗 UniGenBench: https://github.com/CodeGoat24/UniGenBench
 - 🤗 Leaderboard: https://huggingface.co/spaces/CodeGoat24/UniGenBench_Leaderboard
 ## Citation
 ```
+@article{Pref-GRPO&UniGenBench,
+  title={Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning.},
+  author={Wang, Yibin and Li, Zhimin and Zang, Yuhang and Zhou, Yujie and Bu, Jiazi and Wang, Chunyu and Lu, Qinglin, and Jin, Cheng and Wang, Jiaqi},
+  journal={arXiv preprint arXiv:2508.20751},
+  year={2025}
+}
 ```