Qwen3 VL 8B Instruct

Alibaba

Qwen3-VL 8B Dense model has a reduced memory footprint and delivers comprehensive improvements in image/video understanding, ultra-long context support (e.g., long videos and documents), spatial perception, and object recognition, enabling it to handle complex real-world tasks.

Try Now

Capabilities

Tool Use

Image Input

Technical Specifications

Context Window

131,072 tokens

Max Output

32,768 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.18

Non-Reasoning Output

$0.70

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

8B VL; too small for production

Recommended Replacement

Qwen3 Max