Image-Text-to-Text
Transformers
Safetensors
feature-extraction
conversational
custom_code
Yin-Xie commited on
Commit
8eb300d
·
verified ·
1 Parent(s): 193cd55

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ base_model:
17
  A family of fully open-source large multimodal models demonstrating **superior performance** across multiple multimodal benchmarks, **outperforming Qwen2.5-VL** in most evaluation tasks.
18
 
19
  2. **High-Quality Data at Scale**
20
- Meticulously curated **mid-training and SFT data** with rigorous filtering and quality control, achieving **superior data efficiency** with only **5B tokens** (1.2% of Qwen2.5-VL's training data).
21
  - Concept-balanced, highly diverse, high-quality caption data
22
  - Comprehensive instruction fine-tuning data covering a wide range of tasks
23
 
 
17
  A family of fully open-source large multimodal models demonstrating **superior performance** across multiple multimodal benchmarks, **outperforming Qwen2.5-VL** in most evaluation tasks.
18
 
19
  2. **High-Quality Data at Scale**
20
+ Meticulously curated **mid-training and SFT data** with rigorous filtering and quality control.
21
  - Concept-balanced, highly diverse, high-quality caption data
22
  - Comprehensive instruction fine-tuning data covering a wide range of tasks
23