Subliminal Learning: Language models transmit behavioral traits via hidden signals in data Paper • 2507.14805 • Published Jul 20 • 2
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2 • 180
Diffusion Language Models Know the Answer Before Decoding Paper • 2508.19982 • Published 12 days ago • 22
StepWiser: Stepwise Generative Judges for Wiser Reasoning Paper • 2508.19229 • Published 13 days ago • 19
Multimodal Latent Language Modeling with Next-Token Diffusion Paper • 2412.08635 • Published Dec 11, 2024 • 49
FastVLM: Efficient Vision Encoding for Vision Language Models Paper • 2412.13303 • Published Dec 17, 2024 • 60
The Case for Co-Designing Model Architectures with Hardware Paper • 2401.14489 • Published Jan 25, 2024 • 4
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published 14 days ago • 182
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models Paper • 1910.02054 • Published Oct 4, 2019 • 7
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published 20 days ago • 36
Introduction to Latent Variable Energy-Based Models: A Path Towards Autonomous Machine Intelligence Paper • 2306.02572 • Published Jun 5, 2023 • 1
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines Paper • 2310.03714 • Published Oct 5, 2023 • 36
Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning Paper • 2508.09726 • Published 27 days ago • 13
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning Paper • 2508.08221 • Published 28 days ago • 45