GLM 4.6V

zai

A capable multimodal model achieving state-of-the-art visual understanding among models of similar scale. GLM 4.6V combines strong image analysis with the reasoning and tool use capabilities of the GLM family. With 128K context support and vision capabilities, it handles image understanding, document analysis, and visual reasoning tasks effectively. An excellent choice for multimodal workflows where you need reliable visual comprehension without premium pricing.

Try Now

Capabilities

Tool Use

Extended Thinking

Image Input

File Input

Technical Specifications

Context Window

128,000 tokens

Max Output

24,000 tokens

Input Cost

$0.3 per 1M tokens

Output Cost

$0.9 per 1M tokens

File Input Limit

10 files

Files in Assistant Messages

Yes

Availability

No account required

$0 per month

Also available in these plans

Free

Plus

$20/mo

Pro

$100/mo

Super

$200/mo