GLM 4.7 FlashX

zai

A lightweight, high-speed variant of GLM-4.7 delivering enhanced general capabilities and optimized agentic coding at a fraction of the cost. With 200K context, thinking support, and rapid inference, it balances strong programming ability with affordability. Ideal for high-throughput development workflows and agent systems where speed and cost-efficiency matter.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Need fast affordable coding model

Lightweight agentic coding tasks

High-speed development at low cost

Technical Specifications

Context Window

200,000 tokens

Max Output

128,000 tokens

Cache Miss Cost

$0.07 per 1M tokens

Non-Reasoning Cost

$0.40 per 1M tokens

Cache Read Cost

$0.01 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5