The Qwen3 series VL models has been comprehensively upgraded in areas such as visual coding and spatial perception. Its visual perception and recognition capabilities have significantly improved, supporting the understanding of ultra-long videos, and its OCR functionality has undergone a major enhancement.
Try Now131,072 tokens
32,768 tokens
$0.40
$1.60
$15
$0.19
Open VL model; superseded by Qwen3 VL proprietary