Qwen3 VL 30B A3B Thinking

Alibaba

The "Thinking" edition of Qwen3-VL's second-largest MoE model offers fast response, enhanced multimodal understanding and reasoning, visual agent capabilities, and ultra-long context support (e.g., long videos and documents). It improves image/video comprehension, spatial perception, and object recognition to handle complex real-world tasks.

Try Now

Capabilities

Thinking

Tool Use

Image Input

Technical Specifications

Context Window

131,072 tokens

Max Output

32,768 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.20

Non-Reasoning Output

$2.40

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

30B VL thinking; superseded by Qwen3 VL 235B thinking

Recommended Replacement

Qwen3.6 Plus