Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Together AI
Cerebras
Fireworks
Nebius AI
Novita
Groq
Hyperbolic
Nscale
SambaNova
fal
Featherless AI
Cohere
Replicate
HF Inference API
Misc
Reset Misc
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions
Apply filters
Models
28,169
Full-text search
Edit filters
Sort: Trending
Active filters:
8-bit
Clear all
openai/gpt-oss-120b
Text Generation
•
120B
•
Updated
13 days ago
•
3M
•
•
3.78k
openai/gpt-oss-20b
Text Generation
•
22B
•
Updated
13 days ago
•
8.93M
•
•
3.44k
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
May 1
•
6.04k
•
1.16k
jxm/gpt-oss-20b-base
Text Generation
•
12B
•
Updated
19 days ago
•
10.8k
•
216
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
7B
•
Updated
May 22, 2024
•
171k
•
111
nvidia/DeepSeek-R1-0528-FP4-v2
Text Generation
•
394B
•
Updated
6 days ago
•
14.4k
•
3
mlx-community/Qwen3-30B-A3B-Instruct-2507-8bit
Text Generation
•
31B
•
Updated
Jul 29
•
76
•
2
lmstudio-community/gpt-oss-20b-MLX-8bit
Text Generation
•
21B
•
Updated
Aug 5
•
1.05M
•
38
RedHatAI/gpt-oss-20b
Text Generation
•
22B
•
Updated
3 days ago
•
17
•
2
Lightricks/T5-XXL-8bit
5B
•
Updated
Feb 29, 2024
•
40
•
9
ragraph-ai/stable-cypher-instruct-3b
Text Generation
•
3B
•
Updated
Jun 12
•
587
•
27
MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF
Text Generation
•
8B
•
Updated
Sep 18, 2024
•
148k
•
11
MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF
Text Generation
•
1B
•
Updated
Sep 25, 2024
•
153k
•
16
AIFunOver/FLUX.1-dev-openvino-8bit
Text-to-Image
•
Updated
Nov 18, 2024
•
2
MaziyarPanahi/Qwen2.5-Coder-0.5B-QwQ-draft-GGUF
Text Generation
•
0.5B
•
Updated
Jan 7
•
121
•
4
PrunaAI/migueldeguzmandev-paperclippetertodd3-bnb-8bit-smashed
2B
•
Updated
Jan 7
•
6
•
1
mlx-community/DeepSeek-R1-Distill-Llama-70B-8bit
20B
•
Updated
Feb 26
•
1.59k
•
9
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-to-Text
•
8B
•
Updated
Apr 3
•
3.5k
•
6
mlx-community/YandexGPT-5-Lite-8B-pretrain-Q8-mlx
2B
•
Updated
Feb 26
•
48
•
3
yachty66/8-bit-quantized-catvton-flux
Updated
Mar 11
•
10
•
1
MaziyarPanahi/gemma-3-1b-it-GGUF
Text Generation
•
1.0B
•
Updated
Mar 12
•
154k
•
8
MaziyarPanahi/gemma-3-4b-it-GGUF
Text Generation
•
4B
•
Updated
Mar 12
•
151k
•
11
tiiuae/Falcon-E-1B-Instruct
Text Generation
•
0.5B
•
Updated
Jul 10
•
1.26k
•
9
ArtusDev/Delta-Vector_Archaeo-12B-V2_EXL2_8.0bpw_H8
Text Generation
•
Updated
May 20
•
9
•
1
mlx-community/Josiefied-Qwen3-14B-abliterated-v3-8bit
Text Generation
•
4B
•
Updated
Jun 4
•
73
•
1
mlx-community/Dolphin-Mistral-24B-Venice-Edition-mlx-8Bit
7B
•
Updated
Jun 19
•
276
•
3
evoreign/sea-lion-8b-mrl-embedding-merged
Feature Extraction
•
8B
•
Updated
Jul 1
•
59
•
1
codys12/Qwen3-8B-BitNet
Text Generation
•
3B
•
Updated
Jul 7
•
657
•
15
nightmedia/NextCoder-32B-q8-mlx
Text Generation
•
33B
•
Updated
Jul 11
•
12
•
1
baseten/Kimi-K2-Instruct-FP4
581B
•
Updated
13 days ago
•
2.42k
•
1
Previous
1
2
3
...
100
Next