A small audio understanding model released in July 2025
Audio understanding with tool use
Speech-to-text with instructions
Audio analysis and processing
32,768 tokens
$0.10 per 1M tokens
$0.30 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls
Untested