amd
/

Mixtral-8x7B-Instruct-v0.1_FP8_MLPerf_V3

Model card Files Files and versions

linzhao-amd commited on Jul 28

Commit

aef98c2

·

verified ·

1 Parent(s): 464058f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -56,7 +56,7 @@ python3 quantize_quark.py --model_dir "${MODEL}" \
                           --model_export hf_format \
                           --custom_mode fp8 \
                           --quant_algo autosmoothquant \
-                          --exclude_layers "lm_head" "*.gate"
 ```
 # Model Performance Comparison

                           --model_export hf_format \
                           --custom_mode fp8 \
                           --quant_algo autosmoothquant \
+                          --exclude_layers "lm_head" "*.gate" "*.o_proj"
 ```
 # Model Performance Comparison