nvidia/llama-nemoretriever-colembed-3b-v1 Visual Document Retrieval • 4B • Updated 1 day ago • 844 • 66
SAM Audio Collection The SAM Audio model licenses allow for redistribution so long as the original license files are included • 9 items • Updated 6 days ago • 1
ViDoRe Benchmark V3 Collection ViDoRe V3 is our latest benchmark, engineered to set a new industry gold standard for multi-modal, enterprise document retrieval evaluation. • 8 items • Updated Nov 5 • 16
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5 • 57
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 306
SWE-Playground Collection Official Collection for "Training Versatile Coding Agents in Synthetic Environments" • 11 items • Updated Nov 22 • 2
Running on A100 216 Omnilingual ASR Media Transcription 🌍 216 Transcribe audio or video into text in any language
Running on Zero 25 Kartoffel-TTS (Based on Chatterbox) - German Text-to-Speech Demo 📢 25 Expressive Zeroshot TTS