The premium high-speed variant of GLM-4.5, delivering the full reasoning power of the flagship 355B MoE model with ultra-fast inference. Optimized for scenarios requiring both strong reasoning capabilities and rapid response times, it provides the best of both worlds for demanding production workloads. Ideal for interactive agent applications and real-time coding assistance where latency is critical.
Try NowNeed fastest glm reasoning
Premium speed for complex agent tasks
Ultra-fast response with strong reasoning
128,000 tokens
96,000 tokens
$2.20 per 1M tokens
$8.90 per 1M tokens
$0.45 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls