A cost-efficient version of GPT Audio. It features an upgraded decoder for more natural sounding voices and maintains better voice consistency.
128,000 tokens
16,384 tokens
$0.60
$2.40
Audio model; not applicable for text chat