CodeGoat24 commited on
Commit
f14142a
·
verified ·
1 Parent(s): c3eeba3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -26,7 +26,7 @@ Welcome to try the latest version, and the inference code is available at [`here
26
 
27
  ## Model Summary
28
 
29
- `UnifiedReward-2.0-qwen-7b` is the first unified reward model based on [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-32B-Instruct) for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
30
 
31
  For further details, please refer to the following resources:
32
  - 📰 Paper: https://arxiv.org/pdf/2503.05236
 
26
 
27
  ## Model Summary
28
 
29
+ `UnifiedReward-2.0-qwen-7b` is the first unified reward model based on [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
30
 
31
  For further details, please refer to the following resources:
32
  - 📰 Paper: https://arxiv.org/pdf/2503.05236