Meta / meta-llama/Llama-3.2-3B-Instruct
Released: 10/25/2024texttext
Input: $0.02 / Output: $0.02
meta-llama/Llama-3.2-3B-Instruct is an LLM designed for multilingual dialogue, agentic retrieval, and summarization tasks. It excels in supporting up to eight officially recognized languages, offers a long context length of 128,000 tokens, and is optimized for efficient on-device use where privacy and low latency are important.
Some other noteworthy use cases of meta-llama/Llama-3.2-3B-Instruct include tool use (such as extracting action items or sending calendar invites) and customizable fine-tuning for domain-specific applications.
| Metric | Value |
|---|---|
| Parameter Count | 3.21 billion |
| Mixture of Experts | No |
| Context Length | 128,000 tokens |
| Multilingual | Yes |
| Quantized* | No |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
Meta models available on Oxen.ai
| Modality | Price (1M tokens) | ||||
|---|---|---|---|---|---|
| Model | Input | Output | Input | Output | |
| text | text | $3.00 | $3.00 | ||
| text | text | $0.90 | $0.90 | ||
| text | text | $0.90 | $0.90 | ||
| text | text | $0.59 | $0.79 | ||
| text, image | text | $0.22 | $0.88 | ||
| text | text | $0.05 | $0.08 | ||
| text | text | N/A | N/A | ||
| text | text | $0.02 | $0.02 | ||
| text | text | $0.08 | $0.30 | ||