Qwen3-1.7B-W8A8 / recipe.yaml
ramblingpolymath's picture
Upload folder using huggingface_hub
c4707ed verified
raw
history blame contribute delete
172 Bytes
default_stage:
default_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.8}
GPTQModifier:
targets: [Linear]
ignore: [lm_head]
scheme: W8A8