Update README.md
Browse files
README.md
CHANGED
@@ -14,12 +14,12 @@ pipeline_tag: text-generation
|
|
14 |
|
15 |
| Filename | Quant type | File Size | Vram at 16k context|
|
16 |
| -------- | ---------- | --------- |
|
17 |
-
|
|
18 |
-
|
|
19 |
-
|
|
20 |
-
|
|
21 |
-
|
|
22 |
-
|
|
23 |
|
24 |
# Phi-4 Model Card
|
25 |
|
|
|
14 |
|
15 |
| Filename | Quant type | File Size | Vram at 16k context|
|
16 |
| -------- | ---------- | --------- |
|
17 |
+
| [phi-4_hb8_3bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_3bpw) | 3.00 bits per weight | 6.66 GB | **10,3 GB** |
|
18 |
+
| [phi-4_hb8_4bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_4bpw) | 4.00 bits per weight | 8.36 GB | **11,9 GB** |
|
19 |
+
| [phi-4_hb8_5bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_5bpw) | 5.00 bits per weight | 10.1 GB | **13,5 GB** |
|
20 |
+
| [phi-4_hb8_6bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_6bpw) | 6.00 bits per weight | 11.8 GB | **15,1 GB** |
|
21 |
+
| [phi-4_hb8_7bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_7bpw) | 7.00 bits per weight | 13.5 GB | **16,7 GB** |
|
22 |
+
| [phi-4_hb8_8bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_8bpw) | 8.00 bits per weight | 15.2 GB | **18,2 GB** |
|
23 |
|
24 |
# Phi-4 Model Card
|
25 |
|