GLM 4.6V FlashX

zai

A lightweight, high-speed multimodal model from the GLM-4.6V series with native function calling and thinking mode support. Delivers fast visual understanding at a fraction of the cost of the flagship GLM-4.6V while maintaining strong capabilities across image, video, and document tasks. Ideal for production multimodal agents requiring low latency and affordable pricing.

Try Now

Capabilities

Tool Use

Extended Thinking

Image Input

PDF Input

Example Use Cases

Need fast affordable vision model

Lightweight multimodal agent tasks

High-speed image understanding with tools

Technical Specifications

Context Window

128,000 tokens

Max Output

24,000 tokens

Cache Miss Cost

$0.04 per 1M tokens

Non-Reasoning Cost

$0.40 per 1M tokens

Cache Read Cost

$0.004 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5