codefuse-ai
/

CodeFuse-CodeLlama-34B-4bits

Text Generation

text-generation-inference

Model card Files Files and versions

jglee2046 commited on Nov 7, 2023

Commit

444fe8f

·

1 Parent(s): 961a994

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -23,7 +23,6 @@ CodeFuse-CodeLlama-34B-4bits is the 4-bit quantized version of CodeFuse-CodeLlam
 After undergoing 4-bit quantization, the CodeFuse-CodeLlama-34B-4bits model can be loaded on either a single A10 (24GB VRAM) or a RTX 4090 (24GB VRAM). Moreover, the quantized model still achives an impressive accuracy of 73.8% on the Humaneval pass@1 metric.
 <br>
 ## News and Updates
@@ -205,8 +204,6 @@ Here, SHA256 values are provided for the model-related files for consistency che
 |tokenizer.model | 9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347 |
 |tokenizer_config.json | c12441e82f2dce0baff87cf5948e82d6e9b51cc0b5266369c30c319fb771eeb2 |
-<br>
 <br>
 ## Citation
@@ -221,6 +218,7 @@ If you find our [work](https://arxiv.org/abs/2311.02303) useful or helpful for y
       eprint={2311.02303}
 }
 ```
 <a id="chinese"></a>

 After undergoing 4-bit quantization, the CodeFuse-CodeLlama-34B-4bits model can be loaded on either a single A10 (24GB VRAM) or a RTX 4090 (24GB VRAM). Moreover, the quantized model still achives an impressive accuracy of 73.8% on the Humaneval pass@1 metric.
 <br>
 ## News and Updates
 |tokenizer.model | 9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347 |
 |tokenizer_config.json | c12441e82f2dce0baff87cf5948e82d6e9b51cc0b5266369c30c319fb771eeb2 |
 <br>
 ## Citation
       eprint={2311.02303}
 }
 ```
+<br>
 <a id="chinese"></a>