Qwen3 hybrid reasoning model enables seamless switching between thinking and non-thinking modes during conversations. It delivers strong reasoning performance with fewer parameters, comparable to QwQ, and significantly outperforms Qwen2.5-72B-Instruct in general capabilities, achieving state-of-the-art (SOTA) results for its scale.
Try NowStrong reasoning with efficient compute
Complex task needing large alibaba model
Moe model for cost-effective reasoning
131,072 tokens
8,192 tokens
$0.70 per 1M tokens
$2.80 per 1M tokens
$8.40 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls