Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,7 @@ Welcome to try the latest version, and the inference code is available at [`here
|
|
26 |
|
27 |
## Model Summary
|
28 |
|
29 |
-
`UnifiedReward-2.0-qwen-7b` is the first unified reward model based on [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-
|
30 |
|
31 |
For further details, please refer to the following resources:
|
32 |
- 📰 Paper: https://arxiv.org/pdf/2503.05236
|
|
|
26 |
|
27 |
## Model Summary
|
28 |
|
29 |
+
`UnifiedReward-2.0-qwen-7b` is the first unified reward model based on [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
|
30 |
|
31 |
For further details, please refer to the following resources:
|
32 |
- 📰 Paper: https://arxiv.org/pdf/2503.05236
|