TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 11 days ago • 22 • 4
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published 19 days ago • 59 • 9
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published 19 days ago • 59 • 9
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 244 • 6
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper • 2512.01816 • Published Dec 1, 2025 • 88 • 5
AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems Paper • 2510.05432 • Published Oct 6, 2025 • 6 • 4
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 259 • 6
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14, 2025 • 306 • 10
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Paper • 2503.07459 • Published Mar 10, 2025 • 16 • 3
ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks Paper • 2503.06885 • Published Mar 10, 2025 • 4 • 3
MinorBench: A hand-built benchmark for content-based risks for children Paper • 2503.10242 • Published Mar 13, 2025 • 5 • 3
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Paper • 2503.07459 • Published Mar 10, 2025 • 16 • 3
FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation Paper • 2503.06680 • Published Mar 9, 2025 • 20 • 7
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning Paper • 2503.10291 • Published Mar 13, 2025 • 36 • 3
Charting and Navigating Hugging Face's Model Atlas Paper • 2503.10633 • Published Mar 13, 2025 • 92 • 6
Charting and Navigating Hugging Face's Model Atlas Paper • 2503.10633 • Published Mar 13, 2025 • 92 • 6
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12, 2025 • 74 • 3