Qwen3 1.7B

alibaba

The Qwen3 hybrid reasoning model supports seamless switching between thinking and non-thinking modes during conversations. It outperforms the Qwen2.5 small-scale series in general capabilities, with stronger human preference alignment and notable gains in creative writing, role-playing, multi-turn dialogue, and instruction following—resulting in a significantly improved user experience.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Small model with reasoning

Lightweight local deployment with thinking

Resource-constrained creative task

Technical Specifications

Context Window

32,768 tokens

Max Output

8,192 tokens

Cache Miss Cost

$0.11 per 1M tokens

Non-Reasoning Cost

$0.42 per 1M tokens

Reasoning Cost

$1.26 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3 Max