Llama 3.1 405B

Name: Llama 3.1 405B
Brand: meta

Example Use Cases

Base model for fine-tuning

Raw text completion without instruction tuning

Largest open-source llama

Context Window

32,768 tokens

Max Output

32,768 tokens

Cache Miss Cost

$4 per 1M tokens

Non-Reasoning Cost

$4 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

Untested