Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
clowman
/
Llama-3.2-3B-Instruct-GPTQ-Int8
like
0
Text Generation
Transformers
Safetensors
PyTorch
8 languages
llama
facebook
meta
llama-3
conversational
text-generation-inference
8-bit precision
gptq
arxiv:
2204.05149
arxiv:
2405.16406
License:
llama3.2
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Llama-3.2-3B-Instruct-GPTQ-Int8
1 contributor
History:
3 commits
clowman
Update README.md
1dcb6e2
verified
6 months ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
6 months ago
README.md
Safe
42.2 kB
Update README.md
6 months ago
USE_POLICY.md
Safe
6.02 kB
Upload folder using huggingface_hub
6 months ago
args-lambda-quant.json
Safe
258 Bytes
Upload folder using huggingface_hub
6 months ago
config.json
Safe
1.52 kB
Upload folder using huggingface_hub
6 months ago
generation_config.json
Safe
184 Bytes
Upload folder using huggingface_hub
6 months ago
model.safetensors
Safe
3.68 GB
xet
Upload folder using huggingface_hub
6 months ago
quant_log.csv
8.08 kB
Upload folder using huggingface_hub
6 months ago
quantize_config.json
Safe
427 Bytes
Upload folder using huggingface_hub
6 months ago
requirements-lambda-quant.txt
Safe
1.6 kB
Upload folder using huggingface_hub
6 months ago
special_tokens_map.json
Safe
340 Bytes
Upload folder using huggingface_hub
6 months ago
tokenizer.json
Safe
17.2 MB
xet
Upload folder using huggingface_hub
6 months ago
tokenizer_config.json
Safe
54.6 kB
Upload folder using huggingface_hub
6 months ago