Grok 2 Vision

Name: Grok 2 Vision
Brand: xAI

xAI

xAI's legacy vision model with 32K context supporting text and image inputs with function calling and structured outputs. Superseded by Grok 4.

Capabilities

Context Window

32,768 tokens

Max Output

32,768 tokens

Cache Miss Input

Non-Reasoning Output

$10

Cache Read Input

Web Search

$15

Code Execution

$0.19

Image Generation

$70

Outdated model