A lightweight, high-speed multimodal model from the GLM-4.6V series with native function calling and thinking mode support. Delivers fast visual understanding at a fraction of the cost of the flagship GLM-4.6V while maintaining strong capabilities across image, video, and document tasks. Ideal for production multimodal agents requiring low latency and affordable pricing.
Try Now128,000 tokens
24,000 tokens
$0.04
$0.40
$0.004
$15
$0.19