|
--- |
|
license: other |
|
license_name: nvidia-open-model-license |
|
license_link: >- |
|
https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/ |
|
--- |
|
EXL3 quants of [Llama-3.3-Nemotron-Super-49B-v1](https://huggingface.co/nvidia/Llama-3_3-Nemotron-Super-49B-v1/tree/main) |
|
|
|
[1.80 bits per weight / H4](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/1.8bpw_H4) |
|
[2.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.0bpw) |
|
[2.50 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.5bpw) |
|
[3.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.0bpw) |
|
[3.50 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.5bpw) |
|
[4.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/4.0bpw) |
|
[5.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/5.0bpw) |
|
[6.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/6.0bpw) |
|
[8.00 bits per weight / H8](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/8.0bpw_H8) |
|
|
|
 |