Update README.md
Browse files
README.md
CHANGED
@@ -1,19 +1,3 @@
|
|
1 |
-
# Quantization
|
2 |
-
Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov 6 2024, 20:22:13) [GCC 11.4.0]`
|
3 |
-
|
4 |
-
Base Model: [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)
|
5 |
-
|
6 |
-
Quantized using [llmcompressor==0.4.1](https://github.com/vllm-project/llm-compressor)
|
7 |
-
|
8 |
-
Steps to create:
|
9 |
-
1. `git clone https://github.com/LambdaLabsML/lambda-quant`
|
10 |
-
2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
|
11 |
-
3. `python quantize.py -m meta-llama/Llama-3.2-3B-Instruct -q Dynamic-F8`
|
12 |
-
## Evaluation
|
13 |
-
TODO
|
14 |
-
## Benchmarks
|
15 |
-
TODO
|
16 |
-
# Base Model README.md
|
17 |
---
|
18 |
language:
|
19 |
- en
|
@@ -234,6 +218,26 @@ extra_gated_description: >-
|
|
234 |
Policy](https://www.facebook.com/privacy/policy/).
|
235 |
extra_gated_button_content: Submit
|
236 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
237 |
|
238 |
## Model Information
|
239 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
language:
|
3 |
- en
|
|
|
218 |
Policy](https://www.facebook.com/privacy/policy/).
|
219 |
extra_gated_button_content: Submit
|
220 |
---
|
221 |
+
# Quantization
|
222 |
+
Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov 6 2024, 20:22:13) [GCC 11.4.0]`
|
223 |
+
|
224 |
+
Base Model: [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)
|
225 |
+
|
226 |
+
Quantized using [llmcompressor==0.4.1](https://github.com/vllm-project/llm-compressor)
|
227 |
+
|
228 |
+
Steps to create:
|
229 |
+
1. `git clone https://github.com/LambdaLabsML/lambda-quant`
|
230 |
+
2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
|
231 |
+
3. `python quantize.py -m meta-llama/Llama-3.2-3B-Instruct -q Dynamic-F8`
|
232 |
+
|
233 |
+
## Evaluation
|
234 |
+
TODO
|
235 |
+
|
236 |
+
## Benchmarks
|
237 |
+
TODO
|
238 |
+
|
239 |
+
# Base Model README.md
|
240 |
+
|
241 |
|
242 |
## Model Information
|
243 |
|