huggingkot commited on
Commit
9ae71e7
·
1 Parent(s): 2e46495
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.json filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ base_model:
4
+ - SicariusSicariiStuff/LLAMA-3_8B_Unaligned_BETA
5
+ ---
6
+
7
+ This is a converted weight from [LLAMA-3_8B_Unaligned_BETA](https://huggingface.co/SicariusSicariiStuff/LLAMA-3_8B_Unaligned_BETA) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
8
+
9
+ ## About this Conversion
10
+
11
+ This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
12
+
13
+ ### How 4-bit Quantization Works
14
+ - The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
15
+ - **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
16
+
17
+ This allows for reduced memory usage and faster inference while keeping the model compact.
config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb826f8069f22078b94369bafa0286e5e4a8bb88eeff50374cb7d70b97ddef5d
3
+ size 1204
generation_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b365b7f465a08f54ee2a01a3917c55d66205403992f61bec30f8d874de1430b0
3
+ size 192
model-00001-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57aa382dcd717ebcddc12818538fbe63b40655444a0916e32599ed09a524f99a
3
+ size 4652089254
model-00002-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4cff908a6a66fc99d2587657ed1733ebdebffcc5e16d6b2a04dc2d50d57cd946
3
+ size 1050689664
model.safetensors.index.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4786d0cf364371a16c7213be632161e6e0945ff0b802bcf99aecbca95b16147
3
+ size 132271
special_tokens_map.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2df2c4620bb1a9eb877bc7c90c7fa04608bda9fa7c0cf2cdcc0a17b849649683
3
+ size 444
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee14311493150155c133fd76a8abd9207afc2fe851da5fadd5e05cbd084252ea
3
+ size 17210338
tokenizer_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c42410c1416b889d7cc29e1055281bfcf65c233451c96c5d48d796beaec0802
3
+ size 51299