This model is a snapshot version of Qwen-VL-Plus as of August 15, 2025. It approaches the general capabilities of Qwen2.5-VL-32B, with improved performance in object and person recognition, enhanced accuracy in real-world scenarios, and reduced hallucinations.
Try NowBudget image understanding
Object or person recognition
Cost-effective visual task
131,072 tokens
8,192 tokens
$0.21 per 1M tokens
$0.63 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls