GLM 4.6

Zai

The latest and most capable GLM model with comprehensive improvements across all domains. This versatile model excels at real-world coding, handles long contexts up to 200K tokens, and delivers strong performance in reasoning, research, writing, and agentic workflows. With thinking capabilities and an impressive 96K output window, it tackles diverse professional tasks with confidence. The well-rounded upgrade brings enhanced capabilities across the board while maintaining excellent value. Choose this for sophisticated work requiring versatility and depth.

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Real-world coding project

Want glm for versatile professional work

Coding with long context and large output

Technical Specifications

Context Window

200,000 tokens

Max Output

96,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.45

Non-Reasoning Output

$1.80

Cache Read Input

$0.11

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Superseded by GLM 4.7

Recommended Replacement

GLM 5.1