VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs Paper • 2512.22342 • Published 4 days ago • 8
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection Paper • 2512.23273 • Published 1 day ago • 7
OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding Paper • 2512.23646 • Published 1 day ago • 8
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling Paper • 2512.23162 • Published 2 days ago • 7
An Information Theoretic Perspective on Agentic System Design Paper • 2512.21720 • Published 5 days ago • 6
Act2Goal: From World Model To General Goal-conditioned Policy Paper • 2512.23541 • Published 1 day ago • 19
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published 1 day ago • 32
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Paper • 2512.22047 • Published 4 days ago • 25
VA-π: Variational Policy Alignment for Pixel-Aware Autoregressive Generation Paper • 2512.19680 • Published 8 days ago • 9
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models Paper • 2512.19995 • Published 8 days ago • 14
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 7 days ago • 56
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published 13 days ago • 28
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning Paper • 2512.22120 • Published 4 days ago • 12
ProEdit: Inversion-based Editing From Prompts Done Right Paper • 2512.22118 • Published 4 days ago • 14
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 5 days ago • 17
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture Paper • 2512.21675 • Published 5 days ago • 22
Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding Paper • 2512.17220 • Published 12 days ago • 85