GLM 5

zai

Zai's new-generation flagship foundation model designed for Agentic Engineering. GLM-5 delivers state-of-the-art open-source performance in coding and agent capabilities, with usability in real programming scenarios approaching Claude Opus 4.5. Built for complex system engineering and long-range agent tasks, it provides reliable productivity across demanding workflows. With 203K context, 131K output, thinking capabilities, and implicit caching, it excels at sophisticated agentic applications requiring depth and persistence.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Complex system engineering

Long-range agentic workflows

Real-world coding approaching top-tier quality

Technical Specifications

Context Window

202,800 tokens

Max Output

131,072 tokens

Cache Miss Cost

$1 per 1M tokens

Non-Reasoning Cost

$3.20 per 1M tokens

Cache Read Cost

$0.20 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls