2 36 5

Yuxin Zuo

yuxinzuo

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Towards a Unified View of Large Language Model Post-Training

upvoted a paper 19 days ago

Intern-S1: A Scientific Multimodal Foundation Model

upvoted a paper 19 days ago

From AI for Science to Agentic Science: A Survey on Autonomous Scientific Discovery

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published 5 days ago • 60

upvoted 2 papers 19 days ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published 19 days ago • 245

From AI for Science to Agentic Science: A Survey on Autonomous Scientific Discovery

Paper • 2508.14111 • Published 23 days ago • 33

upvoted a paper 22 days ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published 26 days ago • 91

upvoted a paper 28 days ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published Aug 7 • 21

upvoted 2 papers about 1 month ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 125

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 81

liked a model about 2 months ago

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated 12 days ago • 64k • 241

liked a dataset about 2 months ago

AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 6.54k • 158

liked a dataset 3 months ago

ChuGyouk/MedXpertQA

Viewer • Updated Jun 15 • 4.45k • 155 • 5

upvoted a collection 3 months ago

SimpleVLA-RL

Collection

6 items • Updated Jun 15 • 2

authored a paper 3 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 130

upvoted 2 papers 3 months ago

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28 • 46

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 130

upvoted a paper 4 months ago

Learning to Reason without External Rewards

Paper • 2505.19590 • Published May 26 • 29

liked a model 4 months ago

Intelligent-Internet/II-Medical-8B

Text Generation • 8B • Updated 29 days ago • 37.3k • • 172

upvoted 3 papers 4 months ago

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

Paper • 2505.03981 • Published May 6 • 15

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 97

commented a paper 5 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120 •

Yuxin Zuo

AI & ML interests

Recent Activity

Organizations

yuxinzuo's activity