view article Article Welcome EmbeddingGemma, Google's new efficient embedding model By tomaarsen and 5 others • 4 days ago • 165
view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • 27 days ago • 11
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated 2 days ago • 12
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • Aug 8 • 81
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • Aug 5 • 490
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • Aug 4 • 28
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • Jul 31 • 63
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • Jul 29 • 170
Dayhoff Atlas Collection The models and datasets that comprise the Dayhoff Atlas • 10 items • Updated Jul 28 • 8
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • Jul 25 • 80
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? By orrzohar and 3 others • Jul 23 • 39
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 667
SmolLM3 evaluation datasets Collection Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated Jul 8 • 5