Models/meta-llama/Llama-3.2-3B-Instruct
MetaMeta / meta-llama/Llama-3.2-3B-Instruct
Released: 10/25/2024
texttext
Input: $0.02 / Output: $0.02

meta-llama/Llama-3.2-3B-Instruct is an LLM designed for multilingual dialogue, agentic retrieval, and summarization tasks. It excels in supporting up to eight officially recognized languages, offers a long context length of 128,000 tokens, and is optimized for efficient on-device use where privacy and low latency are important.

Some other noteworthy use cases of meta-llama/Llama-3.2-3B-Instruct include tool use (such as extracting action items or sending calendar invites) and customizable fine-tuning for domain-specific applications.

MetricValue
Parameter Count3.21 billion
Mixture of ExpertsNo
Context Length128,000 tokens
MultilingualYes
Quantized*No

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Meta models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
texttext$3.00$3.00
texttext$0.90$0.90
texttext$0.90$0.90
texttext$0.59$0.79
text, imagetext$0.22$0.88
texttext$0.05$0.08
texttextN/AN/A
texttext$0.02$0.02
texttext$0.08$0.30
See all models available on Oxen.ai