Llama 3.1 70B Instruct

meta

Meta's Llama 3.1 70B instruct-tuned version is optimized for high quality dialogue usecases with 128K context. It has demonstrated strong performance compared to leading closed-source models in human evaluations. Usage of this model is subject to Meta's Acceptable Use Policy.

Try Now

Capabilities

Tool Use

Example Use Cases

General-purpose long-context task

Open-source 70B with tool calling

Balanced cost-performance reasoning

Technical Specifications

Context Window

131,072 tokens

Max Output

131,072 tokens

Cache Miss Cost

$0.40 per 1M tokens

Non-Reasoning Cost

$0.40 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus