mlx-community
/

FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview-Q8

8-bit precision

Model card Files Files and versions

bobig commited on Feb 20

Commit

7f20c2d

·

verified ·

1 Parent(s): 28a4512

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -60,5 +60,8 @@ response = generate(model, tokenizer, prompt=prompt, verbose=True)
 ```
 Are you still reading down here?  Really?
-Ok, try this new Q4 lossless quant compression and tell us how to improve mlx-lm for 4-bit speed at 8-bit quality.
-https://huggingface.co/NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant

 ```
 Are you still reading down here?  Really?
+Maybe use your OCD super powers to try this new Q4 lossless quant compression and tell us how to improve mlx-lm to get 4-bit speed at 8-bit quality.
+https://huggingface.co/NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant
+ore