|
--- |
|
library_name: transformers |
|
pipeline_tag: text-generation |
|
license: apache-2.0 |
|
language: |
|
- en |
|
base_model: |
|
- miromind-ai/MiroThinker-32B-SFT-v0.2 |
|
tags: |
|
- agent |
|
- open-source |
|
- miromind |
|
--- |
|
|
|
<div align="center"> |
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/68525b342230a897a65cc1c0/87mYQ_a-4jpnMkVR4hrgm.png" width="55%" alt="MiroThinker" /> |
|
</div> |
|
<!-- <hr> --> |
|
<div align="center"> |
|
|
|
[](https://dr.miromind.ai/) |
|
[](https://huggingface.co/collections/miromind-ai/mirothinker-v02-68af084a18035f57b17cd902) |
|
[](https://huggingface.co/datasets/miromind-ai/MiroVerse-v0.1) |
|
[](https://miromind.ai/blog/miromind-research-agent) |
|
|
|
[](https://github.com/MiroMindAI/MiroThinker) |
|
[](https://discord.com/invite/GPqEnkzQZd) |
|
[](https://huggingface.co/datasets/miromind-ai/MiroFlow-Benchmarks/resolve/main/assets/wechat.png) |
|
[](https://www.xiaohongshu.com/user/profile/5e353bd80000000001000239) |
|
[](https://miromind.ai/) |
|
|
|
</div> |
|
|
|
## Introduction |
|
|
|
MiroThinker is an open-source agentic model series. Designed as a research agent for complex, long-horizon problem solving, it integrates strong capabilities in task decomposition, multi-hop reasoning, retrieval-augmented generation, code execution, web browsing, and document/file processing, enabling a wide range of real-world applications. |
|
|
|
In MiroThinker-v0.2, we introduced three key improvements: |
|
|
|
- **Richer training data** from both English and Chinese sources, yielding significant gains in benchmark performance and generalization. |
|
- **Unified DPO training** with a single preference dataset across all models. |
|
- **Extended context length** from 40k to 64k for more challenging multi-turn tool-use tasks. |
|
|
|
Compared to v0.1, MiroThinker-v0.2 delivers consistent gains across benchmarks. For example, scores improved from **57.3 → 64.1** on **GAIA-Text-103** and from **17.0 → 29.4** on **BrowseComp-ZH**, reflecting substantial advancements in the model’s general research agent capabilities. |
|
|
|
<div> |
|
<img src="https://huggingface.co/datasets/miromind-ai/MiroFlow-Benchmarks/resolve/main/assets/MiroThinker_v0.2_Performance_2.png" width="100%" alt="MiroThinker" /> |
|
</div> |
|
|
|
## Online Demo |
|
|
|
Welcome to try out our online demo [here](https://dr.miromind.ai/). |
|
|
|
## Performance |
|
|
|
> [!IMPORTANT] |
|
> <div> |
|
> To prevent data leakage during searches, we block Hugging Face domains to ensure the model doesn't access answers through shortcuts. |
|
> </div> |
|
|
|
### Comparison with SOTA Research Agents |
|
|
|
<div> |
|
<img src="https://huggingface.co/datasets/miromind-ai/MiroFlow-Benchmarks/resolve/main/assets/MiroThinker_v0.2_Performance_0.png" width="100%" alt="MiroThinker" /> |
|
</div> |
|
|
|
### GAIA Benchmark |
|
|
|
<div> |
|
<img src="https://huggingface.co/datasets/miromind-ai/MiroFlow-Benchmarks/resolve/main/assets/MiroThinker_v0.2_Performance_1.png" width="100%" alt="MiroThinker" /> |
|
</div> |
|
|
|
## Quick Start |
|
|
|
MiroThinker-v0.2 is trained on our large-scale, high-quality trajectory and preference datasets MiroVerse-v0.2, utilizing the efficient training framework [MiroTrain](https://github.com/MiroMindAI/MiroTrain), and enhanced with tool-use capabilities through our agentic framework [MiroFlow](https://github.com/MiroMindAI/MiroFlow). |
|
|
|
To promote reproducibility and benefit the community, we decided to open-source the entire suite mentioned above. For more technical details, evaluation results, and usage tutorials, please visit our [GitHub repository](https://github.com/MiroMindAI/MiroThinker). |
|
|
|
## License |
|
|
|
MiroThinker-v0.2 is licensed under Apache 2.0. |
|
|
|
## Contact Us |
|
|
|
MiroThinker is developed by the MiroMind Foundation Model Team. |
|
If you would like to leave us a message, feel free to get in touch. |
|
In addition to [GitHub](https://github.com/MiroMindAI/), |
|
[Discord](https://discord.com/invite/GPqEnkzQZd), |
|
[WeChat](https://huggingface.co/datasets/miromind-ai/MiroFlow-Benchmarks/resolve/main/assets/wechat.png), |
|
and [RedNote](https://www.xiaohongshu.com/user/profile/5e353bd80000000001000239), |
|
you can also reach us via email at [email protected]. |