view article Article Welcome EmbeddingGemma, Google's new efficient embedding model By tomaarsen and 5 others • 4 days ago • 150
Built with Distill blog ❤️ Collection Collection of all interactive blogs built on top of Distill template. To create your own check: https://huggingface.co/spaces/lvwerra/distill-blog-tem • 6 items • Updated Mar 14 • 2
view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation By cbensimon and 3 others • 6 days ago • 40
FastVLM Collection Efficient Vision Encoding for Vision Language Models • 9 items • Updated 5 days ago • 88
SuryaBench Collection Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction • 8 items • Updated 20 days ago • 5
view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others • about 1 month ago • 59
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 21 days ago • 54
NVIDIA Nemotron Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 4 items • Updated 4 days ago • 56
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 70
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 17 days ago • 277
Technical Report: Full-Stack Fine-Tuning for the Q Programming Language Paper • 2508.06813 • Published 29 days ago • 5
qqWen-Series Collection Based off the Qwen-2.5 Series - model finetuned for the Q programming language. • 11 items • Updated 10 days ago • 10
Aryabhata: An exam-focused language model for JEE Math Paper • 2508.08665 • Published 26 days ago • 16