Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

6,959

Full-text search

Active filters: gptq

baichuan-inc/Baichuan-M2-32B-GPTQ-Int4

Text Generation • 6B • Updated 6 days ago • 1.57k • 9

QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4

Text Generation • 6B • Updated 4 days ago • 461 • 4

TheBloke/phi-2-GPTQ

Text Generation • 0.6B • Updated Dec 18, 2023 • 2.48k • 30

RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit

Text Generation • 1B • Updated Jun 10, 2024 • 52.5k • 21

hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4

Text Generation • 2B • Updated Aug 7, 2024 • 18.4k • 38

shuyuej/Llama-3.2-1B-Instruct-GPTQ

0.4B • Updated Sep 25, 2024 • 8.9k • 4

openbmb/MiniCPM-o-2_6-int4

Any-to-Any • Updated 6 days ago • 2.35k • 51

empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit

Text Generation • 6B • Updated Feb 16 • 1.38k • 4

Qwen/Qwen3-30B-A3B-GPTQ-Int4

Text Generation • 5B • Updated May 21 • 113k • 26

Qwen/Qwen2.5-Omni-7B-GPTQ-Int4

Any-to-Any • 5B • Updated May 15 • 265 • 10

orvp/gemma-3-27b-it-gptq

5B • Updated May 21 • 1.11k • 2

QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact

Text Generation • Updated Jun 19 • 1.26k • 4

AlphaGaO/UIGEN-X-8B-GPTQ

Text Generation • 2B • Updated Jul 18 • 23 • 1

QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix

Text Generation • Updated 20 days ago • 609 • 1

QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8

Text Generation • 8B • Updated 4 days ago • 3.84k • 6

QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3

Text Generation • 5B • Updated 4 days ago • 69 • 2

JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8

Text Generation • 1B • Updated 5 days ago • 54 • 1

groxaxo/Qwen3-32B-AWorld-W8A16

9B • Updated 6 days ago • 33 • 1

groxaxo/Huihui-gpt-oss-20b-BF16-abliterated-W8A16

20B • Updated 6 days ago • 8 • 1

groxaxo/gpt-oss-20b-ShiningValiant3-W8A16

Text Generation • 20B • Updated 5 days ago • 14 • 1

openbmb/MiniCPM4.1-8B-Marlin

Text Generation • Updated 3 days ago • 11 • 1

elinas/alpaca-13b-lora-int4

Text Generation • Updated Apr 5, 2023 • 8 • 41

elinas/alpaca-30b-lora-int4

Text Generation • Updated Apr 5, 2023 • 11 • 68

mayaeary/pygmalion-6b-4bit-128g

Text Generation • Updated Mar 28, 2023 • 29 • 40

mayaeary/pygmalion-6b_dev-4bit-128g

Text Generation • Updated Mar 28, 2023 • 18 • 120

mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g

Text Generation • Updated Mar 31, 2023 • 7 • 2

mayaeary/PPO_Pygway-6b-Mix-4bit-128g

Text Generation • Updated Mar 31, 2023 • 6 • 2

elinas/vicuna-13b-4bit

Text Generation • Updated Apr 5, 2023 • 5 • 45

TheBloke/koala-7B-GPTQ

Text Generation • 1B • Updated Aug 21, 2023 • 30 • 31

TheBloke/koala-7B-HF

Text Generation • Updated Jun 5, 2023 • 1.83k • 21