-
-
-
-
-
-
Inference Providers
Active filters:
gptq
baichuan-inc/Baichuan-M2-32B-GPTQ-Int4
Text Generation
•
6B
•
Updated
•
1.57k
•
9
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
•
6B
•
Updated
•
461
•
4
TheBloke/phi-2-GPTQ
Text Generation
•
0.6B
•
Updated
•
2.48k
•
30
RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
1B
•
Updated
•
52.5k
•
21
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
2B
•
Updated
•
18.4k
•
38
shuyuej/Llama-3.2-1B-Instruct-GPTQ
0.4B
•
Updated
•
8.9k
•
4
openbmb/MiniCPM-o-2_6-int4
Any-to-Any
•
Updated
•
2.35k
•
51
empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit
Text Generation
•
6B
•
Updated
•
1.38k
•
4
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
113k
•
26
Qwen/Qwen2.5-Omni-7B-GPTQ-Int4
Any-to-Any
•
5B
•
Updated
•
265
•
10
orvp/gemma-3-27b-it-gptq
5B
•
Updated
•
1.11k
•
2
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
•
Updated
•
1.26k
•
4
AlphaGaO/UIGEN-X-8B-GPTQ
Text Generation
•
2B
•
Updated
•
23
•
1
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
•
Updated
•
609
•
1
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
3.84k
•
6
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
•
5B
•
Updated
•
69
•
2
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
•
1B
•
Updated
•
54
•
1
groxaxo/Qwen3-32B-AWorld-W8A16
9B
•
Updated
•
33
•
1
groxaxo/Huihui-gpt-oss-20b-BF16-abliterated-W8A16
20B
•
Updated
•
8
•
1
groxaxo/gpt-oss-20b-ShiningValiant3-W8A16
Text Generation
•
20B
•
Updated
•
14
•
1
openbmb/MiniCPM4.1-8B-Marlin
Text Generation
•
Updated
•
11
•
1
elinas/alpaca-13b-lora-int4
Text Generation
•
Updated
•
8
•
41
elinas/alpaca-30b-lora-int4
Text Generation
•
Updated
•
11
•
68
mayaeary/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
29
•
40
mayaeary/pygmalion-6b_dev-4bit-128g
Text Generation
•
Updated
•
18
•
120
mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g
Text Generation
•
Updated
•
7
•
2
mayaeary/PPO_Pygway-6b-Mix-4bit-128g
Text Generation
•
Updated
•
6
•
2
elinas/vicuna-13b-4bit
Text Generation
•
Updated
•
5
•
45
TheBloke/koala-7B-GPTQ
Text Generation
•
1B
•
Updated
•
30
•
31
TheBloke/koala-7B-HF
Text Generation
•
Updated
•
1.83k
•
21