Update README.md
Browse files
README.md
CHANGED
@@ -60,5 +60,8 @@ response = generate(model, tokenizer, prompt=prompt, verbose=True)
|
|
60 |
```
|
61 |
|
62 |
Are you still reading down here? Really?
|
63 |
-
|
64 |
-
|
|
|
|
|
|
|
|
60 |
```
|
61 |
|
62 |
Are you still reading down here? Really?
|
63 |
+
|
64 |
+
Maybe use your OCD super powers to try this new Q4 lossless quant compression and tell us how to improve mlx-lm to get 4-bit speed at 8-bit quality.
|
65 |
+
https://huggingface.co/NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant
|
66 |
+
|
67 |
+
ore
|