Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Together AI
Fireworks
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,614
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
iqbalamo93/gemma-3-12b-it-GGUF-q8_0
Image-Text-to-Text
•
12B
•
Updated
May 17
•
128
•
1
google/gemma-3n-E2B-it-litert-preview
Image-Text-to-Text
•
Updated
May 20
•
562
Hcompany/Holo1-7B
Image-Text-to-Text
•
8B
•
Updated
Jun 10
•
2k
•
222
mlabonne/gemma-3-12b-it-abliterated-v2-GGUF
Image-Text-to-Text
•
12B
•
Updated
May 29
•
6.22k
•
30
lmstudio-community/medgemma-4b-it-MLX-4bit
Image-Text-to-Text
•
0.9B
•
Updated
May 29
•
1.47k
•
2
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text
•
8B
•
Updated
Jun 7
•
19.8k
•
160
microsoft/GUI-Actor-7B-Qwen2.5-VL
Image-Text-to-Text
•
8B
•
Updated
Aug 9
•
1.33k
•
23
echo840/MonkeyOCR
Image-Text-to-Text
•
Updated
27 days ago
•
606
•
510
google/gemma-3n-E2B
Image-Text-to-Text
•
5B
•
Updated
Jul 14
•
2.86k
•
66
moonshotai/Kimi-VL-A3B-Thinking-2506
Image-Text-to-Text
•
16B
•
Updated
Aug 18
•
16.5k
•
307
SoybeanMilk/Kimi-VL-A3B-Thinking-2506-BNB-4bit
Image-Text-to-Text
•
9B
•
Updated
Jul 27
•
1.11k
•
10
Vchitect/ShotVL-7B
Image-Text-to-Text
•
8B
•
Updated
5 days ago
•
1.31k
•
14
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text
•
7B
•
Updated
Jun 30
•
46.8k
•
162
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text
•
8B
•
Updated
Jul 11
•
32.5k
•
9
amine-khelif/MaVistral-GGUF
Image-Text-to-Text
•
24B
•
Updated
Jul 7
•
87
•
5
zai-org/GLM-4.1V-9B-Base
Image-Text-to-Text
•
10B
•
Updated
27 days ago
•
5.18k
•
54
baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle
Image-Text-to-Text
•
424B
•
Updated
Aug 19
•
1.07k
•
60
NCSOFT/VARCO-VISION-2.0-1.7B-OCR
Image-Text-to-Text
•
2B
•
Updated
9 days ago
•
6.18k
•
22
echo840/MonkeyOCR-pro-3B
Image-Text-to-Text
•
Updated
27 days ago
•
558
•
3
echo840/MonkeyOCR-pro-1.2B
Image-Text-to-Text
•
Updated
27 days ago
•
534
•
15
openbmb/MiniCPM-V-4-gguf
Image-Text-to-Text
•
4B
•
Updated
5 days ago
•
5.95k
•
40
openbmb/MiniCPM-V-4
Image-Text-to-Text
•
4B
•
Updated
9 days ago
•
37.6k
•
461
openbmb/MiniCPM-V-4-int4
Image-Text-to-Text
•
2B
•
Updated
9 days ago
•
718
•
6
nvidia/VideoITG-8B
Image-Text-to-Text
•
8B
•
Updated
Aug 13
•
166
•
7
allenai/olmOCR-7B-0725
Image-Text-to-Text
•
8B
•
Updated
28 days ago
•
8.91k
•
58
CohereLabs/command-a-vision-07-2025
Image-Text-to-Text
•
112B
•
Updated
Aug 2
•
53.2k
•
•
83
drwlf/MedraN-E4B
Image-Text-to-Text
•
8B
•
Updated
Aug 13
•
5
•
1
ducviet00/Florence-2-large-hf
Image-Text-to-Text
•
0.8B
•
Updated
Aug 18
•
2.99k
•
1
nicoboss/MedraN-E4B-Uncensored-EP7
Image-Text-to-Text
•
8B
•
Updated
Aug 13
•
11
•
2
XiaomiMiMo/MiMo-VL-7B-SFT-2508
Image-Text-to-Text
•
8B
•
Updated
Aug 21
•
5.34k
•
30
Previous
1
...
5
6
7
8
9
...
100
Next