The Qwen3 hybrid reasoning model supports seamless switching between thinking and non-thinking modes during conversations. It outperforms the Qwen2.5 small-scale series in general capabilities.
Try NowSmallest possible alibaba model
Edge device deployment
Ultra-low-resource environment
32,768 tokens
8,192 tokens
$0.11 per 1M tokens
$0.42 per 1M tokens
$1.26 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls