Breaking the Data Barrier -- Building GUI Agents Through Task Generalization Paper • 2504.10127 • Published Apr 14, 2025 • 17
A2Eval: Agentic and Automated Evaluation for Embodied Brain Paper • 2602.01640 • Published 30 days ago • 8
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 23 days ago • 41
P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads Paper • 2602.09443 • Published 22 days ago • 57
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Paper • 2602.02477 • Published 29 days ago • 10
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 23 days ago • 41
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 23 days ago • 41
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 30 days ago • 32
A2Eval: Agentic and Automated Evaluation for Embodied Brain Paper • 2602.01640 • Published 30 days ago • 8
A2Eval: Agentic and Automated Evaluation for Embodied Brain Paper • 2602.01640 • Published 30 days ago • 8
Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models Paper • 2511.23319 • Published Nov 28, 2025 • 24
P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published Nov 17, 2025 • 134
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts Paper • 2503.16057 • Published Mar 20, 2025 • 14
Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting Paper • 2310.08129 • Published Oct 12, 2023
QUBE: Enhancing Automatic Heuristic Design via Quality-Uncertainty Balanced Evolution Paper • 2412.20694 • Published Dec 30, 2024
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models Paper • 2401.13919 • Published Jan 25, 2024 • 32