Qwen3 30B A3B

alibaba

Qwen3 hybrid reasoning model enables seamless switching between thinking and non-thinking modes during conversations. It delivers strong reasoning performance with fewer parameters, comparable to QwQ-32B, and significantly outperforms Qwen2.5-14B in general capabilities, achieving state-of-the-art (SOTA) results for its scale.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Efficient reasoning on a budget

Cost-effective complex task

Moe model for lightweight reasoning

Technical Specifications

Context Window

131,072 tokens

Max Output

8,192 tokens

Cache Miss Cost

$0.20 per 1M tokens

Non-Reasoning Cost

$0.80 per 1M tokens

Reasoning Cost

$2.40 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3 Max