Symbolic Graphics Programming with Large Language Models Paper • 2509.05208 • Published 3 days ago • 28
MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting Paper • 2509.03800 • Published 5 days ago • 3
LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation Paper • 2509.05263 • Published 3 days ago • 6
WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning Paper • 2509.04744 • Published 4 days ago • 8
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth Paper • 2509.03867 • Published 5 days ago • 190
Towards a Unified View of Large Language Model Post-Training Paper • 2509.04419 • Published 4 days ago • 58
Instruction-Following Evaluation for Large Language Models Paper • 2311.07911 • Published Nov 14, 2023 • 22
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions? Paper • 2509.04292 • Published 4 days ago • 49
MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement Paper • 2509.01977 • Published 7 days ago • 11
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots Paper • 2509.02530 • Published 6 days ago • 7
Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings Paper • 2508.18733 • Published 14 days ago • 7
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks Paper • 2509.01396 • Published 7 days ago • 50
Planning with Reasoning using Vision Language World Model Paper • 2509.02722 • Published 6 days ago • 15
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation Paper • 2509.00428 • Published 10 days ago • 15
Robix: A Unified Model for Robot Interaction, Reasoning and Planning Paper • 2509.01106 • Published 8 days ago • 43
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published 6 days ago • 164
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper • 2509.02544 • Published 6 days ago • 107
ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding Paper • 2508.21496 • Published 10 days ago • 53