Qwen 3 Max Thinking

Alibaba

The most capable Qwen reasoning model, integrating thinking and non-thinking modes for comprehensive problem-solving. In thinking mode, it combines deliberate reasoning with web search, web extraction, and code interpreter tools to tackle complex challenges with greater accuracy. With a 256K context window and 65K output tokens, this model excels at problems requiring both deep thought and external tool use.

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Complex multi-step reasoning with tool access

Need deliberate slow thinking for difficult problems

Combining reasoning with web search or code execution

Technical Specifications

Context Window

262,144 tokens

Max Output

65,536 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$1.20

Non-Reasoning Output

$6

Cache Read Input

$0.24

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Retired

Made legacy on

Reason

Merged

Recommended Replacement

Qwen3 Max

Retired on