The Qwen3 Flash model (snapshot 2025-07-28) offers a powerful fusion of thinking and non-thinking modes with dynamic in-conversation switching, excelling in complex reasoning while showing significant gains in instruction following and text comprehension. It supports a 1M context length and is billed on a tiered model corresponding to context usage.
Try NowNeed fast cheap alibaba response
Very long context needed with alibaba
Simple to moderate alibaba task
1,000,000 tokens
32,768 tokens
$0.05 per 1M tokens
$0.40 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls