Qwen3.5 397B A17B

Alibaba

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers state-of-the-art performance comparable to leading-edge models across a wide range of tasks, including language understanding, logical reasoning, code generation, agent-based tasks, image understanding, video understanding, and graphical user interface (GUI) interactions. With its robust code-generation and agent capabilities, the model exhibits strong generalization across diverse agent scenarios.

Try Now

Capabilities

Thinking

Tool Use

Image Input

Technical Specifications

Context Window

256,000 tokens

Max Output

64,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.60

Non-Reasoning Output

$3.60

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19