Llama 3.2 1B Instruct

meta

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate efficiently in low-resource environments while maintaining strong task performance. Supporting eight core languages and fine-tunable for more, it is ideal for businesses or developers seeking lightweight yet powerful AI solutions that can operate in diverse multilingual settings without the high computational demand of larger models. Usage of this model is subject to Meta's Acceptable Use Policy.

Try Now

Example Use Cases

Ultra-lightweight text task

Low-resource or edge deployment

Simple multilingual summarization

Technical Specifications

Context Window

60,000 tokens

Max Output

60,000 tokens

Cache Miss Cost

$0.027 per 1M tokens

Non-Reasoning Cost

$0.20 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus