GLM 4.6V

Zai

A capable multimodal model achieving state-of-the-art visual understanding among models of similar scale. GLM 4.6V combines strong image analysis with the reasoning and tool use capabilities of the GLM family. With 128K context support and vision capabilities, it handles image understanding, document analysis, and visual reasoning tasks effectively. An excellent choice for multimodal workflows where you need reliable visual comprehension without premium pricing.

Try Now

Capabilities

Thinking

Tool Use

Image Input

PDF Input

Technical Specifications

Context Window

128,000 tokens

Max Output

24,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.30

Non-Reasoning Output

$0.90

Cache Read Input

$0.05

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Outdated model

Recommended Replacement

GLM 5V Turbo