GLM 4.7 FlashX

Zai

A lightweight, high-speed variant of GLM-4.7 delivering enhanced general capabilities and optimized agentic coding at a fraction of the cost. With 200K context, thinking support, and rapid inference, it balances strong programming ability with affordability. Ideal for high-throughput development workflows and agent systems where speed and cost-efficiency matter.

Try Now

Capabilities

Thinking

Tool Use

Technical Specifications

Context Window

200,000 tokens

Max Output

128,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.07

Non-Reasoning Output

$0.40

Cache Read Input

$0.01

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5.1