GLM 4 32B

zai

A highly cost-effective 32B foundation model with enhanced capabilities in tool use, online search, and code-related intelligent tasks. Pre-trained on 15T of high-quality data including abundant synthetic reasoning data, it performs comparably to much larger models on many benchmarks. At just $0.1 per million tokens for both input and output, it delivers exceptional value for production workloads requiring tool invocation, information extraction, and code generation.

Try Now

Capabilities

Tool Use

Example Use Cases

Need cheapest glm model

Budget tool use and search tasks

Lightweight code generation

Technical Specifications

Context Window

128,000 tokens

Max Output

16,000 tokens

Cache Miss Cost

$0.10 per 1M tokens

Non-Reasoning Cost

$0.10 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Superseded by GLM 5

Recommended Replacement

GLM 5