Update README.md
Browse files
README.md
CHANGED
@@ -56,7 +56,7 @@ python3 quantize_quark.py --model_dir "${MODEL}" \
|
|
56 |
--model_export hf_format \
|
57 |
--custom_mode fp8 \
|
58 |
--quant_algo autosmoothquant \
|
59 |
-
--exclude_layers "lm_head" "*.gate"
|
60 |
```
|
61 |
|
62 |
# Model Performance Comparison
|
|
|
56 |
--model_export hf_format \
|
57 |
--custom_mode fp8 \
|
58 |
--quant_algo autosmoothquant \
|
59 |
+
--exclude_layers "lm_head" "*.gate" "*.o_proj"
|
60 |
```
|
61 |
|
62 |
# Model Performance Comparison
|