nm-research commited on
Commit
c8b4ee2
·
verified ·
1 Parent(s): 6043ef6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -66,6 +66,10 @@ vLLM also supports OpenAI-compatible serving. See the [documentation](https://do
66
  This model was created with [llm-compressor](https://github.com/vllm-project/llm-compressor) by running the code snippet below.
67
 
68
 
 
 
 
 
69
  ```python
70
  import argparse
71
  from transformers import AutoModelForCausalLM, AutoTokenizer
 
66
  This model was created with [llm-compressor](https://github.com/vllm-project/llm-compressor) by running the code snippet below.
67
 
68
 
69
+ ```bash
70
+ python quantize.py --model_id ibm-granite/granite-3.1-2b-base --save_path "output_dir/"
71
+ ```
72
+
73
  ```python
74
  import argparse
75
  from transformers import AutoModelForCausalLM, AutoTokenizer