Can Understanding and Generation Truly Benefit Together -- or Just Coexist? Paper • 2509.09666 • Published 8 days ago • 32
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model Paper • 2509.09372 • Published 8 days ago • 189
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published 9 days ago • 159
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published 10 days ago • 95
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published 9 days ago • 581
Towards a Unified View of Large Language Model Post-Training Paper • 2509.04419 • Published 15 days ago • 71
FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games Paper • 2509.01052 • Published 19 days ago • 19
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published 17 days ago • 192
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning Paper • 2508.20096 • Published 23 days ago • 36
Predicting the Order of Upcoming Tokens Improves Language Modeling Paper • 2508.19228 • Published 24 days ago • 21
Diffusion Language Models Know the Answer Before Decoding Paper • 2508.19982 • Published 23 days ago • 22
FastMesh:Efficient Artistic Mesh Generation via Component Decoupling Paper • 2508.19188 • Published 24 days ago • 15
Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels Paper • 2508.17437 • Published about 1 month ago • 35