Qwen3 Max

Alibaba

Compared with the snapshot as of September 23, 2025, the Qwen-3 series Max model in this release achieves an effective integration of thinking and non-thinking modes, resulting in a comprehensive and substantial improvement in the model's overall performance. In thinking mode, the model simultaneously supports web search, web information extraction, and a code interpreter tool, enabling it to tackle more complex and challenging problems with greater accuracy by leveraging external tools while engaging in slow, deliberative reasoning. This version is based on a snapshot taken on January 23, 2026.

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Best alibaba model needed

Complex reasoning or coding with alibaba

Agentic task requiring tool use

Technical Specifications

Context Window

262,144 tokens

Max Output

32,768 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input
≤ 32,000 input$1.20
≤ 128,000 input$2.40
> 128,000 input$3
Non-Reasoning Output
≤ 32,000 input$6
≤ 128,000 input$12
> 128,000 input$15

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19