The Qwen3 Turbo is a new hybrid reasoning model enabling dynamic switching between reasoning and non-reasoning modes mid-dialogue. With fewer parameters, it rivals the QwQ-32B in reasoning performance while surpassing the Qwen2.5-Turbo in general capabilities, achieving state-of-the-art (SOTA) performance at its scale. This model is a snapshot version as of April 28, 2025
Try NowUltra-fast alibaba response needed
Low-cost alibaba task
Simple classification or instruction following
131,072 tokens
8,192 tokens
$0.05 per 1M tokens
$0.20 per 1M tokens
$0.50 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls