Qwen3 VL 8B Thinking

Alibaba

The "Thinking" edition of Qwen3-VL 8B Dense has a reduced memory footprint, enabling multimodal understanding and reasoning. It supports ultra-long contexts (e.g., long videos and documents), 2D/3D visual localization, and enhances image/video comprehension, spatial perception, and object recognition.

Try Now

Capabilities

Thinking

Tool Use

Image Input

Technical Specifications

Context Window

131,072 tokens

Max Output

32,768 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.18

Non-Reasoning Output

$2.10

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

8B VL; too small for production

Recommended Replacement

Qwen3.6 Plus