GLM 4.6V FlashX

Zai

A lightweight, high-speed multimodal model from the GLM-4.6V series with native function calling and thinking mode support. Delivers fast visual understanding at a fraction of the cost of the flagship GLM-4.6V while maintaining strong capabilities across image, video, and document tasks. Ideal for production multimodal agents requiring low latency and affordable pricing.

Try Now

Capabilities

Thinking

Tool Use

Image Input

PDF Input

Technical Specifications

Context Window

128,000 tokens

Max Output

24,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.04

Non-Reasoning Output

$0.40

Cache Read Input

$0.004

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5.1