GLM 4.6V Flash

Zai

A completely free multimodal model with native function calling support from the GLM-4.6V series. Handles image, video, and document understanding at zero cost while supporting tool invocation for building multimodal agents. With 128K context, it provides substantial capability for visual understanding workflows without any API costs.

Try Now

Capabilities

Thinking

Tool Use

Image Input

PDF Input

Technical Specifications

Context Window

128,000 tokens

Max Output

24,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0

Non-Reasoning Output

$0

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5.1