GLM 4.5 Flash

zai

A completely free GLM model with strong reasoning, coding, and agent capabilities. Despite being free, it delivers impressive performance suitable for a wide range of tasks including development workflows, agent applications, and general reasoning. With 200K context and thinking support, it provides substantial capability at zero cost — perfect for experimentation, prototyping, and budget-sensitive production use.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Need free reasoning model

Budget-constrained agent and coding tasks

Zero-cost glm with thinking capabilities

Technical Specifications

Context Window

200,000 tokens

Max Output

96,000 tokens

Cache Miss Cost

$0 per 1M tokens

Non-Reasoning Cost

$0 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5