The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1 Pro model uses more compute to think harder and provide consistently better answers.
Try NowHardest reasoning problems
Tasks requiring deep thinking
Correctness-critical complex analysis
200,000 tokens
100,000 tokens
$150
$600
$15
$0.19