GPT-4o Audio

OpenAI

The GPT-4o Audio model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences.

Capabilities

Tool Use

Technical Specifications

Context Window

128,000 tokens

Max Output

16,384 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$2.50

Non-Reasoning Output

$10

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Retired

Made legacy on March 6, 2026

Reason

Audio model; not applicable for text chat

Recommended Replacement

Retired on March 6, 2026