view article Article Welcome EmbeddingGemma, Google's new efficient embedding model By tomaarsen and 5 others • 4 days ago • 165
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning Paper • 2508.19828 • Published 12 days ago • 4
AWorld: Orchestrating the Training Recipe for Agentic AI Paper • 2508.20404 • Published 11 days ago • 37
FastVLM Collection Efficient Vision Encoding for Vision Language Models • 9 items • Updated 6 days ago • 90
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper • 2508.16153 • Published 17 days ago • 133
Clio: Privacy-Preserving Insights into Real-World AI Use Paper • 2412.13678 • Published Dec 18, 2024 • 1
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published 19 days ago • 80
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 21 days ago • 54
view article Article MCP for Research: How to Connect AI to Research Tools By dylanebert • 21 days ago • 45
Conformal Prediction of Classifiers with Many Classes based on Noisy Labels Paper • 2501.12749 • Published Jan 22 • 1
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning Paper • 2508.08221 • Published 28 days ago • 45