GLM 4.5

Zai

The most powerful GLM reasoning model with 355B total parameters and 32B active per forward pass using Mixture-of-Experts architecture. GLM-4.5 ranks second globally among all models on aggregated benchmarks, first among domestic and open-source models. Purpose-built for agent-oriented applications, it excels at tool invocation, web browsing, software engineering, and front-end development. Supports hybrid reasoning modes for both complex thinking and instant responses.

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Complex reasoning and agent tasks

Need top-tier open-source coding model

Software engineering with agent workflows

Technical Specifications

Context Window

128,000 tokens

Max Output

96,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.60

Non-Reasoning Output

$2.20

Cache Read Input

$0.11

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5.1