GLM 4.5 X

Zai

The premium high-speed variant of GLM-4.5, delivering the full reasoning power of the flagship 355B MoE model with ultra-fast inference. Optimized for scenarios requiring both strong reasoning capabilities and rapid response times, it provides the best of both worlds for demanding production workloads. Ideal for interactive agent applications and real-time coding assistance where latency is critical.

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Need fastest glm reasoning

Premium speed for complex agent tasks

Ultra-fast response with strong reasoning

Technical Specifications

Context Window

128,000 tokens

Max Output

96,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$2.20

Non-Reasoning Output

$8.90

Cache Read Input

$0.45

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5.1