Qwen Turbo

alibaba

The Qwen3 Turbo is a new hybrid reasoning model enabling dynamic switching between reasoning and non-reasoning modes mid-dialogue. With fewer parameters, it rivals the QwQ-32B in reasoning performance while surpassing the Qwen2.5-Turbo in general capabilities, achieving state-of-the-art (SOTA) performance at its scale. This model is a snapshot version as of April 28, 2025

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Ultra-fast alibaba response needed

Low-cost alibaba task

Simple classification or instruction following

Technical Specifications

Context Window

131,072 tokens

Max Output

8,192 tokens

Cache Miss Cost

$0.05 per 1M tokens

Non-Reasoning Cost

$0.20 per 1M tokens

Reasoning Cost

$0.50 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3 Max