turboderp's picture
Update README.md
980d516 verified
---
license: other
license_name: nvidia-open-model-license
license_link: >-
https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
---
EXL3 quants of [Llama-3.3-Nemotron-Super-49B-v1](https://huggingface.co/nvidia/Llama-3_3-Nemotron-Super-49B-v1/tree/main)
[1.80 bits per weight / H4](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/1.8bpw_H4)
[2.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.0bpw)
[2.50 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.5bpw)
[3.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.0bpw)
[3.50 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.5bpw)
[4.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/4.0bpw)
[5.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/5.0bpw)
[6.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/6.0bpw)
[8.00 bits per weight / H8](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/8.0bpw_H8)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6383dc174c48969dcf1b4fce/PXwVukMFqjCcCuyaOg0YM.png)