Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
33
152
43
KABI
dongguanting
Follow
blc0910's profile picture
BasitMustafa's profile picture
zhangboguodong's profile picture
47 followers
·
85 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
6 days ago
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
upvoted
a
paper
6 days ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
upvoted
a
paper
6 days ago
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
View all activity
Organizations
dongguanting
's datasets
11
Sort: Recently updated
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
28 days ago
•
54.6k
•
511
•
9
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
28 days ago
•
10k
•
227
•
3
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Jul 29
•
1.07k
•
169
•
4
dongguanting/RAG-Error-Critic-100K
Viewer
•
Updated
Jun 28
•
100k
•
20
•
2
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
May 29
•
54k
•
290
•
8
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
May 25
•
10k
•
142
•
4
dongguanting/RAG-QA-40K
Viewer
•
Updated
Dec 27, 2024
•
32.8k
•
18
•
2
dongguanting/ShareGPT-12K
Viewer
•
Updated
Dec 27, 2024
•
12.9k
•
28
•
1
dongguanting/VIF-RAG-QA-110K
Viewer
•
Updated
Dec 27, 2024
•
111k
•
48
•
7
dongguanting/DotamathQA
Viewer
•
Updated
Dec 26, 2024
•
574k
•
58
•
2
dongguanting/VIF-RAG-QA-20K
Viewer
•
Updated
Nov 1, 2024
•
20k
•
5
•
4