Upload complete model
Browse files
README.md
CHANGED
|
@@ -20,7 +20,8 @@ base_model: openai/gpt-oss-120b
|
|
| 20 |
|
| 21 |
## Usage Notes
|
| 22 |
|
| 23 |
-
*
|
| 24 |
* Memory usage: ~95 GB (down from ~251GB required by native MXFP4 format)
|
| 25 |
* Expect ~60 tokens/s
|
|
|
|
| 26 |
* For more details see [demonstration video](https://youtu.be/mlpFG8e_fLw) or visit [OpenAI gpt-oss-20b](https://huggingface.co/openai/gpt-oss-120b).
|
|
|
|
| 20 |
|
| 21 |
## Usage Notes
|
| 22 |
|
| 23 |
+
* Tested to run with [Inferencer app](https://inferencer.com)
|
| 24 |
* Memory usage: ~95 GB (down from ~251GB required by native MXFP4 format)
|
| 25 |
* Expect ~60 tokens/s
|
| 26 |
+
* Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.26
|
| 27 |
* For more details see [demonstration video](https://youtu.be/mlpFG8e_fLw) or visit [OpenAI gpt-oss-20b](https://huggingface.co/openai/gpt-oss-120b).
|