Qwen Turbo

Alibaba

The Qwen3 Turbo is a new hybrid reasoning model enabling dynamic switching between reasoning and non-reasoning modes mid-dialogue. With fewer parameters, it rivals the QwQ-32B in reasoning performance while surpassing the Qwen2.5-Turbo in general capabilities, achieving state-of-the-art (SOTA) performance at its scale. This model is a snapshot version as of April 28, 2025

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Ultra-fast alibaba response needed

Low-cost alibaba task

Simple classification or instruction following

Technical Specifications

Context Window

131,072 tokens

Max Output

8,192 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.05

Non-Reasoning Output

$0.20

Reasoning Output

$0.50

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3 Max