GPT-4o Audio

openai

The GPT-4o Audio model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences.

Try Now

Capabilities

Tool Use

Example Use Cases

Audio input processing

Voice-aware prompting

Audio nuance detection

Technical Specifications

Context Window

128,000 tokens

Max Output

16,384 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$2.50

Non-Reasoning Output

$10

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

⚠️ Legacy

Made legacy on

Reason

We don't support audio models yet

Recommended Replacement

Qwen3.5 Plus