Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Cerebras
Together AI
Fireworks
Nebius AI
Novita
Groq
Hyperbolic
Nscale
fal
SambaNova
Featherless AI
Cohere
Replicate
HF Inference API
Misc
Reset Misc
arxiv:
2505.09388
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions
Apply filters
Models
418
Full-text search
Edit filters
Sort: Trending
Active filters:
2505.09388
Clear all
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation
•
31B
•
Updated
19 days ago
•
320k
•
•
569
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation
•
480B
•
Updated
19 days ago
•
236k
•
•
1.17k
Qwen/Qwen3-0.6B
Text Generation
•
0.8B
•
Updated
Jul 26
•
4.39M
•
•
604
Qwen/Qwen3-30B-A3B-Instruct-2507
Text Generation
•
31B
•
Updated
23 days ago
•
1.06M
•
•
546
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF
Text Generation
•
31B
•
Updated
Aug 8
•
185k
•
223
Qwen/Qwen3-4B-Thinking-2507
Text Generation
•
4B
•
Updated
Aug 6
•
224k
•
•
359
Qwen/Qwen3-4B-Instruct-2507
Text Generation
•
4B
•
Updated
Aug 6
•
1.03M
•
•
274
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation
•
235B
•
Updated
23 days ago
•
57.7k
•
•
345
Qwen/Qwen3-8B
Text Generation
•
8B
•
Updated
Jul 26
•
2.35M
•
•
587
Qwen/Qwen3-14B
Text Generation
•
15B
•
Updated
Jul 26
•
1.16M
•
•
264
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation
•
235B
•
Updated
23 days ago
•
92.5k
•
•
671
unsloth/Qwen3-4B-Instruct-2507-GGUF
4B
•
Updated
20 days ago
•
66.5k
•
60
Qwen/Qwen3-30B-A3B-Thinking-2507
Text Generation
•
31B
•
Updated
23 days ago
•
210k
•
•
261
unsloth/Qwen3-30B-A3B-Thinking-2507-GGUF
31B
•
Updated
Jul 31
•
46.3k
•
94
Qwen/Qwen3-32B
Text Generation
•
33B
•
Updated
Jul 26
•
909k
•
•
524
unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF
31B
•
Updated
Jul 31
•
81.4k
•
217
Qwen/Qwen3-1.7B
Text Generation
•
2B
•
Updated
Jul 26
•
926k
•
•
247
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation
•
235B
•
Updated
Jul 30
•
36.4k
•
59
Qwen/Qwen3-4B
Text Generation
•
4B
•
Updated
Jul 26
•
1.31M
•
•
377
Qwen/Qwen3-30B-A3B
Text Generation
•
31B
•
Updated
Jul 26
•
700k
•
•
774
Qwen/Qwen3-Coder-30B-A3B-Instruct-FP8
Text Generation
•
31B
•
Updated
19 days ago
•
85.9k
•
68
Qwen/Qwen3-4B-Thinking-2507-FP8
Text Generation
•
4B
•
Updated
Aug 6
•
180k
•
30
Qwen/Qwen3-235B-A22B
Text Generation
•
235B
•
Updated
Jul 26
•
147k
•
•
1.03k
Qwen/Qwen3-4B-MLX-4bit
Text Generation
•
0.6B
•
Updated
11 days ago
•
630
•
9
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation
•
235B
•
Updated
Jul 30
•
32k
•
121
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
•
480B
•
Updated
19 days ago
•
147k
•
•
111
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF
Text Generation
•
480B
•
Updated
Jul 31
•
19.7k
•
153
Qwen/Qwen3-30B-A3B-Instruct-2507-FP8
Text Generation
•
31B
•
Updated
Jul 29
•
91k
•
70
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF
Text Generation
•
31B
•
Updated
Aug 5
•
29.2k
•
102
unsloth/Qwen3-4B-Thinking-2507-GGUF
4B
•
Updated
Aug 6
•
38.7k
•
43
Previous
1
2
3
...
14
Next