Qwen3.5 Plus

Alibaba

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency. In a variety of task evaluations, the 3.5 series consistently demonstrates performance on par with state-of-the-art leading models. Compared to the 3 series, these models show a leap forward in both pure-text and multimodal capabilities.

Try Now

Capabilities

Thinking

Tool Use

Image Input

Technical Specifications

Context Window

1,000,000 tokens

Max Output

65,536 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input
≤ 256,000 input$0.40
> 256,000 input$1.20
Non-Reasoning Output
≤ 256,000 input$2.40
> 256,000 input$7.20
Cache Read Input
≤ 256,000 input$0.04
> 256,000 input$0.12
Cache Write Input
≤ 256,000 input$0.50
> 256,000 input$1.50

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Outdated model

Recommended Replacement

Qwen3.6 Plus