CodeGoat24
/

UnifiedReward-2.0-qwen-7b

Model card Files Files and versions

CodeGoat24 commited on 7 days ago

Commit

f14142a

·

verified ·

1 Parent(s): c3eeba3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ Welcome to try the latest version, and the inference code is available at [`here
 ## Model Summary
-`UnifiedReward-2.0-qwen-7b` is the first unified reward model based on [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-32B-Instruct) for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
 For further details, please refer to the following resources:
 - 📰 Paper: https://arxiv.org/pdf/2503.05236

 ## Model Summary
+`UnifiedReward-2.0-qwen-7b` is the first unified reward model based on [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
 For further details, please refer to the following resources:
 - 📰 Paper: https://arxiv.org/pdf/2503.05236