The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3 Pro model uses more compute to think harder and provide consistently better answers.
Try NowHardest reasoning problems
Tasks requiring maximum intelligence
Problems where consistency matters most
200,000 tokens
100,000 tokens
$20
$80
$15
$0.19