inferencerlabs commited on
Commit
6515609
·
verified ·
1 Parent(s): fa92bd9

Upload complete model

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -20,7 +20,8 @@ base_model: openai/gpt-oss-120b
20
 
21
  ## Usage Notes
22
 
23
- * Built with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.26
24
  * Memory usage: ~95 GB (down from ~251GB required by native MXFP4 format)
25
  * Expect ~60 tokens/s
 
26
  * For more details see [demonstration video](https://youtu.be/mlpFG8e_fLw) or visit [OpenAI gpt-oss-20b](https://huggingface.co/openai/gpt-oss-120b).
 
20
 
21
  ## Usage Notes
22
 
23
+ * Tested to run with [Inferencer app](https://inferencer.com)
24
  * Memory usage: ~95 GB (down from ~251GB required by native MXFP4 format)
25
  * Expect ~60 tokens/s
26
+ * Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.26
27
  * For more details see [demonstration video](https://youtu.be/mlpFG8e_fLw) or visit [OpenAI gpt-oss-20b](https://huggingface.co/openai/gpt-oss-120b).