Our premier vision-language model combining 235B parameters with exceptional visual understanding and reasoning. This model excels at analyzing images, charts, diagrams, and documents with a special focus on STEM and mathematical content. With dramatically improved OCR capabilities and visual perception, it handles everything from handwritten equations to complex technical diagrams. Perfect for research, education, and any task requiring sophisticated visual analysis.
Try NowAnalyzing math equations or scientific diagrams
Extracting text from images (ocr)
Stem-focused image analysis
131,072 tokens
32,768 tokens
$0.70 per 1M tokens
$8.40 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls