xAI's legacy vision model with 32K context supporting text and image inputs with function calling and structured outputs. Superseded by Grok 4.
32,768 tokens
$2
$10
$0
$15
$0.19
$70
Outdated model