A capable multimodal model achieving state-of-the-art visual understanding among models of similar scale. GLM 4.6V combines strong image analysis with the reasoning and tool use capabilities of the GLM family. With 128K context support and vision capabilities, it handles image understanding, document analysis, and visual reasoning tasks effectively. An excellent choice for multimodal workflows where you need reliable visual comprehension without premium pricing.
Try Now128,000 tokens
24,000 tokens
$0.3 per 1M tokens
$0.9 per 1M tokens
10 files
Yes
$0 per month