Papers - a TheOneTrueNiz Collection

TheOneTrueNiz 's Collections

Papers

Language Models

Papers

updated 6 days ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published 12 days ago • 105
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Paper • 2508.16949 • Published 17 days ago • 22
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published 12 days ago • 73
UItron: Foundational GUI Agent with Advanced Perception and Planning

Paper • 2508.21767 • Published 11 days ago • 12
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 7 days ago • 79