Sergio Paniego PRO
AI & ML interests
Recent Activity
Organizations
-
Running41
comparevlms
🏃41Compare Vision Language Models
-
Running on Zero66
OCR Time Machine
📚66Extract text from images and XML files using OCR models
-
Running26
Compare Docvqa Models
🦀26Compare different visual question answering
-
Running on CPU Upgrade23
Compare Clip Siglip
🏃23Compare strong zero-shot image classification models
-
Qwen/Qwen2.5-Omni-7B
Any-to-Any • 11B • Updated • 170k • 1.83k -
RunningFeatured364
Qwen2.5 Omni 7B Demo
🏆364Generate text and speech responses from text, audio, images, or video input
-
Qwen2.5-Omni Technical Report
Paper • 2503.20215 • Published • 168 -
openbmb/MiniCPM-o-2_6
Any-to-Any • 9B • Updated • 103k • 1.27k
-
Running41
comparevlms
🏃41Compare Vision Language Models
-
Runtime error4
Gemma3 License Plate Detection
📈4Gemma 3 for license plate detection
-
Running on ZeroFeatured141
Gemma 3n E4B It
⚡141Generate text responses to images, videos, and audio
-
Running on ZeroFeatured34
Moondream3
🏢34Image and video tasks with moondream3.
-
Running41
comparevlms
🏃41Compare Vision Language Models
-
Running on Zero66
OCR Time Machine
📚66Extract text from images and XML files using OCR models
-
Running26
Compare Docvqa Models
🦀26Compare different visual question answering
-
Running on CPU Upgrade23
Compare Clip Siglip
🏃23Compare strong zero-shot image classification models
-
Running41
comparevlms
🏃41Compare Vision Language Models
-
Runtime error4
Gemma3 License Plate Detection
📈4Gemma 3 for license plate detection
-
Running on ZeroFeatured141
Gemma 3n E4B It
⚡141Generate text responses to images, videos, and audio
-
Running on ZeroFeatured34
Moondream3
🏢34Image and video tasks with moondream3.
-
Qwen/Qwen2.5-Omni-7B
Any-to-Any • 11B • Updated • 170k • 1.83k -
RunningFeatured364
Qwen2.5 Omni 7B Demo
🏆364Generate text and speech responses from text, audio, images, or video input
-
Qwen2.5-Omni Technical Report
Paper • 2503.20215 • Published • 168 -
openbmb/MiniCPM-o-2_6
Any-to-Any • 9B • Updated • 103k • 1.27k