NexaAI
/

Llama3.2-3B-NPU-Turbo

Text Generation

Model card Files Files and versions

Llama3.2-3B-NPU-Turbo / README.md

alanzhuly's picture

Create README.md

b4a405b verified about 2 months ago

|

2.17 kB

	# Llama3.2-3B
	Run Llama3.2-3B optimized for Qualcomm NPUs with [nexaSDK](https://sdk.nexa.ai).

	## Quickstart

	1. Install nexaSDK and create a free account at [sdk.nexa.ai](https://sdk.nexa.ai)
	2. Activate your device with your access token:
	```bash
	nexa config set license '<access_token>'
	```
	3. Run the model on Qualcomm NPU in one line:
	```bash
	nexa infer NexaAI/Llama3.2-3B-NPU-Turbo
	```

	## Model Description
	Llama3.2-3B is a 3-billion-parameter language model from Meta’s Llama 3.2 series.
	It is designed to provide a balance of efficiency and capability, making it suitable for deployment on a wide range of devices while maintaining strong performance on core language understanding and generation tasks.

	Trained on diverse, high-quality datasets, Llama3.2-3B supports multiple languages and is optimized for scalability, fine-tuning, and real-world applications.

	## Features
	- Lightweight yet capable: delivers strong performance with a smaller memory footprint.
	- Conversational AI: context-aware dialogue for assistants and agents.
	- Content generation: text completion, summarization, code comments, and more.
	- Reasoning & analysis: step-by-step problem solving and explanation.
	- Multilingual: supports understanding and generation in multiple languages.
	- Customizable: can be fine-tuned for domain-specific or enterprise use.

	## Use Cases
	- Personal and enterprise chatbots
	- On-device AI applications
	- Document and report summarization
	- Education and tutoring tools
	- Specialized models in verticals (e.g., healthcare, finance, legal)

	## Inputs and Outputs
	Input:
	- Text prompts or conversation history (tokenized input sequences).

	Output:
	- Generated text: responses, explanations, or creative content.
	- Optionally: raw logits/probabilities for advanced downstream tasks.

	## License
	- Licensed under: [Meta Llama 3.2 License](https://huggingface.co/meta-llama/Llama-3.2-3B/blob/main/LICENSE)

	## References
	- [Meta AI – Llama Models](https://ai.meta.com/research/publications/llama/)
	- [Hugging Face Model Card](https://huggingface.co/meta-llama/Llama-3.2-3B)