xAI's legacy vision model with 32K context supporting text and image inputs with function calling and structured outputs. Superseded by Grok 4.
Try NowBasic image understanding
Legacy xai vision tasks
32,768 tokens
32,768 tokens
$2 per 1M tokens
$10 per 1M tokens
$0 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls
$20 per 1K calls