GLM 4.5

zai

The most powerful GLM reasoning model with 355B total parameters and 32B active per forward pass using Mixture-of-Experts architecture. GLM-4.5 ranks second globally among all models on aggregated benchmarks, first among domestic and open-source models. Purpose-built for agent-oriented applications, it excels at tool invocation, web browsing, software engineering, and front-end development. Supports hybrid reasoning modes for both complex thinking and instant responses.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Complex reasoning and agent tasks

Need top-tier open-source coding model

Software engineering with agent workflows

Technical Specifications

Context Window

128,000 tokens

Max Output

96,000 tokens

Cache Miss Cost

$0.60 per 1M tokens

Non-Reasoning Cost

$2.20 per 1M tokens

Cache Read Cost

$0.11 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5