Qwen3 0.6B

Alibaba

The Qwen3 hybrid reasoning model supports seamless switching between thinking and non-thinking modes during conversations. It outperforms the Qwen2.5 small-scale series in general capabilities.

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Smallest possible alibaba model

Edge device deployment

Ultra-low-resource environment

Technical Specifications

Context Window

32,768 tokens

Max Output

8,192 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.11

Non-Reasoning Output

$0.42

Reasoning Output

$1.26

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3 Max