GLM 4.5 X

zai

The premium high-speed variant of GLM-4.5, delivering the full reasoning power of the flagship 355B MoE model with ultra-fast inference. Optimized for scenarios requiring both strong reasoning capabilities and rapid response times, it provides the best of both worlds for demanding production workloads. Ideal for interactive agent applications and real-time coding assistance where latency is critical.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Need fastest glm reasoning

Premium speed for complex agent tasks

Ultra-fast response with strong reasoning

Technical Specifications

Context Window

128,000 tokens

Max Output

96,000 tokens

Cache Miss Cost

$2.20 per 1M tokens

Non-Reasoning Cost

$8.90 per 1M tokens

Cache Read Cost

$0.45 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5