GPT Audio

openai

The GPT Audio model is OpenAI's first generally available audio model. It features an upgraded decoder for more natural sounding voices and maintains better voice consistency.

Try Now

Example Use Cases

Audio generation tasks

Voice synthesis

Natural speech output

Technical Specifications

Context Window

128,000 tokens

Max Output

16,384 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$2.50

Non-Reasoning Output

$10

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

⚠️ Legacy

Made legacy on

Reason

We don't support audio models yet

Recommended Replacement

Qwen3.5 Plus