A highly cost-effective 32B foundation model with enhanced capabilities in tool use, online search, and code-related intelligent tasks. Pre-trained on 15T of high-quality data including abundant synthetic reasoning data, it performs comparably to much larger models on many benchmarks. At just $0.1 per million tokens for both input and output, it delivers exceptional value for production workloads requiring tool invocation, information extraction, and code generation.
Try NowNeed cheapest glm model
Budget tool use and search tasks
Lightweight code generation
128,000 tokens
16,000 tokens
$0.10 per 1M tokens
$0.10 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls