clowman commited on
Commit
3f16a1e
·
verified ·
1 Parent(s): fa30a27

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -16
README.md CHANGED
@@ -1,19 +1,3 @@
1
- # Quantization
2
- Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov 6 2024, 20:22:13) [GCC 11.4.0]`
3
-
4
- Base Model: [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)
5
-
6
- Quantized using [llmcompressor==0.4.1](https://github.com/vllm-project/llm-compressor)
7
-
8
- Steps to create:
9
- 1. `git clone https://github.com/LambdaLabsML/lambda-quant`
10
- 2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
11
- 3. `python quantize.py -m meta-llama/Llama-3.2-3B-Instruct -q Dynamic-F8`
12
- ## Evaluation
13
- TODO
14
- ## Benchmarks
15
- TODO
16
- # Base Model README.md
17
  ---
18
  language:
19
  - en
@@ -234,6 +218,26 @@ extra_gated_description: >-
234
  Policy](https://www.facebook.com/privacy/policy/).
235
  extra_gated_button_content: Submit
236
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
237
 
238
  ## Model Information
239
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  language:
3
  - en
 
218
  Policy](https://www.facebook.com/privacy/policy/).
219
  extra_gated_button_content: Submit
220
  ---
221
+ # Quantization
222
+ Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov 6 2024, 20:22:13) [GCC 11.4.0]`
223
+
224
+ Base Model: [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)
225
+
226
+ Quantized using [llmcompressor==0.4.1](https://github.com/vllm-project/llm-compressor)
227
+
228
+ Steps to create:
229
+ 1. `git clone https://github.com/LambdaLabsML/lambda-quant`
230
+ 2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
231
+ 3. `python quantize.py -m meta-llama/Llama-3.2-3B-Instruct -q Dynamic-F8`
232
+
233
+ ## Evaluation
234
+ TODO
235
+
236
+ ## Benchmarks
237
+ TODO
238
+
239
+ # Base Model README.md
240
+
241
 
242
  ## Model Information
243