GPT-4o Audio

OpenAI

The GPT-4o Audio model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences.

Try Now

Capabilities

Tool Use

Technical Specifications

Context Window

128,000 tokens

Max Output

16,384 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$2.50

Non-Reasoning Output

$10

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Retired

Made legacy on

Reason

Audio model; not applicable for text chat

Recommended Replacement

GPT-5.5

Retired on