btbtyler09
/

Qwen3-Coder-30B-A3B-Instruct-gptq-4bit

Text Generation

4-bit precision

Model card Files Files and versions

Qwen3-Coder-30B-A3B-Instruct-gptq-4bit / README.md

btbtyler09's picture

Update README.md

147d73a verified 3 months ago

|

history blame contribute delete

525 Bytes

	---
	base_model: Qwen/Qwen3-Coder-30B-A3B-Instruct
	library_name: transformers
	license_link: https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/LICENSE
	license: apache-2.0
	tags:
	- qwen3
	- qwen
	- gptq
	- 4 Bit
	---

	# 4-bit Quantization of the Qwen3 30B A3B Thinking 2507 Model
	Quantized using GPTQModel

	quantiziation config:
	```
	quant_config = QuantizeConfig(
	bits=4,
	group_size=32,
	sym=True,
	desc_act=False,
	true_sequential=True,
	pack_dtype=torch.int32,
	damp_percent=0.1
	)
	```