The Qwen2.5VL series model has reached a level close to Qwen2.5VL-72B in answering math and subject questions, with the reply style significantly adjusted towards human preferences, especially for objective queries such as mathematics, logical reasoning, knowledge Q&A, etc., where the model's detailed responses and clear formatting have been notably improved. This version is the 32B version.
Try NowStrong open vision model needed
Visual math or document understanding
Self-hosted multimodal model
131,072 tokens
8,192 tokens
$1.40 per 1M tokens
$4.20 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls