Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ library_name: transformers
|
|
15 |
|
16 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
17 |
|
18 |
-
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated training
|
19 |
|
20 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
21 |
|
|
|
15 |
|
16 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
17 |
|
18 |
+
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated **training data**: [ViRL39K](https://huggingface.co/datasets/TIGER-Lab/ViRL39K).
|
19 |
|
20 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
21 |
|