Qwen3.6 Plus

Alibaba

The Qwen3.6 native vision-language Plus series models demonstrate exceptional performance on par with the current state-of-the-art models, with a significant improvement in overall results compared to the 3.5 series. The models have been markedly enhanced in code-related capabilities such as agentic coding, front-end programming, and Vibe coding, as well as in multi-modal general object recognition, OCR, and object localization.

Try Now

Capabilities

Thinking

Tool Use

Image Input

Example Use Cases

Multimodal vision-language task with alibaba

Agentic coding and front-end programming

Long-context multimodal understanding

Technical Specifications

Context Window

1,000,000 tokens

Max Output

65,536 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input
≤ 256,000 input$0.276
> 256,000 input$1.101
Non-Reasoning Output
≤ 256,000 input$1.651
> 256,000 input$6.602
Cache Read Input
≤ 256,000 input$0.028
> 256,000 input$0.111
Cache Write Input
≤ 256,000 input$0.344
> 256,000 input$1.376

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19