Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
33
152
43
KABI
dongguanting
Follow
varuy322's profile picture
ankits0052's profile picture
ChazzyGordon's profile picture
47 followers
·
85 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
6 days ago
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
upvoted
a
paper
6 days ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
upvoted
a
paper
6 days ago
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
View all activity
Organizations
dongguanting
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
10 days ago
meituan-longcat/LongCat-Flash-Chat
Text Generation
•
562B
•
Updated
1 day ago
•
32.8k
•
433
liked
a dataset
12 days ago
inclusionAI/ASearcher-train-data
Preview
•
Updated
27 days ago
•
703
•
12
liked
2 datasets
25 days ago
We-Math/We-Math2.0-Pro
Viewer
•
Updated
21 days ago
•
4.55k
•
1.02k
•
18
We-Math/We-Math2.0-Standard
Viewer
•
Updated
21 days ago
•
5.84k
•
1.09k
•
19
liked
a model
28 days ago
Kwai-Klear/Klear-Reasoner-8B
8B
•
Updated
12 days ago
•
132
•
13
liked
a model
about 1 month ago
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28
•
18
•
3
liked
3 datasets
about 2 months ago
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
28 days ago
•
54.6k
•
511
•
9
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Jul 29
•
1.07k
•
169
•
4
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
28 days ago
•
10k
•
227
•
3
liked
5 models
about 2 months ago
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
28 days ago
•
17
•
1
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
28 days ago
•
64
•
4
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
21 days ago
•
64
•
2
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29
•
44
•
1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
28 days ago
•
24
•
1
liked
3 models
2 months ago
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6
•
44
•
2
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6
•
11
•
1
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30
•
20
•
2
liked
a dataset
2 months ago
basicv8vc/SimpleQA
Viewer
•
Updated
Nov 5, 2024
•
4.33k
•
4.05k
•
23
liked
a dataset
3 months ago
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
May 29
•
54k
•
290
•
8
liked
a dataset
4 months ago
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
May 25
•
10k
•
142
•
4
Load more