Llama 3.1 70B Instruct

Meta

Meta's Llama 3.1 70B instruct-tuned version is optimized for high quality dialogue usecases with 128K context. It has demonstrated strong performance compared to leading closed-source models in human evaluations. Usage of this model is subject to Meta's Acceptable Use Policy.

Try Now

Capabilities

Tool Use

Example Use Cases

General-purpose long-context task

Open-source 70B with tool calling

Balanced cost-performance reasoning

Technical Specifications

Context Window

131,072 tokens

Max Output

131,072 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.40

Non-Reasoning Output

$0.40

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.6 Plus