Llama 3.1 405B Instruct

meta

Meta's Llama 3.1 405B instruct-tuned version is optimized for high quality dialogue usecases with 128K context. It has demonstrated strong performance compared to leading closed-source models including GPT-4o and Claude 3.5 Sonnet in evaluations. Usage of this model is subject to Meta's Acceptable Use Policy.

Try Now

Capabilities

Tool Use

Example Use Cases

Complex task needing largest open-source model

Long-context analysis with tool use

Open-source alternative to gpt-4o

Technical Specifications

Context Window

131,000 tokens

Max Output

131,000 tokens

Cache Miss Cost

$4 per 1M tokens

Non-Reasoning Cost

$4 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus