The GPT-4o Audio model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences.
Try NowAudio input processing
Voice-aware prompting
Audio nuance detection
128,000 tokens
16,384 tokens
$2.50
$10
$15
$0.19