NexaAI
/

Llama3.2-3B-NPU-Turbo

Text Generation

Model card Files Files and versions

alanzhuly commited on Sep 3

Commit

b4a405b

·

verified ·

1 Parent(s): 98cd655

Create README.md

Files changed (1) hide show

README.md +50 -0

README.md ADDED Viewed

	@@ -0,0 +1,50 @@

+# Llama3.2-3B
+Run **Llama3.2-3B** optimized for **Qualcomm NPUs** with [nexaSDK](https://sdk.nexa.ai).
+## Quickstart
+1. **Install nexaSDK** and create a free account at [sdk.nexa.ai](https://sdk.nexa.ai)
+2. **Activate your device** with your access token:
+   ```bash
+   nexa config set license '<access_token>'
+   ```
+3. Run the model on Qualcomm NPU in one line:
+   ```bash
+   nexa infer NexaAI/Llama3.2-3B-NPU-Turbo
+   ```
+## Model Description
+**Llama3.2-3B** is a 3-billion-parameter language model from Meta’s Llama 3.2 series.
+It is designed to provide a balance of efficiency and capability, making it suitable for deployment on a wide range of devices while maintaining strong performance on core language understanding and generation tasks.
+Trained on diverse, high-quality datasets, Llama3.2-3B supports multiple languages and is optimized for scalability, fine-tuning, and real-world applications.
+## Features
+- **Lightweight yet capable**: delivers strong performance with a smaller memory footprint.
+- **Conversational AI**: context-aware dialogue for assistants and agents.
+- **Content generation**: text completion, summarization, code comments, and more.
+- **Reasoning & analysis**: step-by-step problem solving and explanation.
+- **Multilingual**: supports understanding and generation in multiple languages.
+- **Customizable**: can be fine-tuned for domain-specific or enterprise use.
+## Use Cases
+- Personal and enterprise chatbots
+- On-device AI applications
+- Document and report summarization
+- Education and tutoring tools
+- Specialized models in verticals (e.g., healthcare, finance, legal)
+## Inputs and Outputs
+**Input**:
+- Text prompts or conversation history (tokenized input sequences).
+**Output**:
+- Generated text: responses, explanations, or creative content.
+- Optionally: raw logits/probabilities for advanced downstream tasks.
+## License
+- Licensed under: [Meta Llama 3.2 License](https://huggingface.co/meta-llama/Llama-3.2-3B/blob/main/LICENSE)
+## References
+- [Meta AI – Llama Models](https://ai.meta.com/research/publications/llama/)
+- [Hugging Face Model Card](https://huggingface.co/meta-llama/Llama-3.2-3B)