Qwen3.5 9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design with early fusion of multimodal tokens, allowing the model to process and reason across text and images within the same context.
Try NowBudget multimodal reasoning and coding
Lightweight vision-language understanding
Efficient small model with thinking
256,000 tokens
65,536 tokens
$0.05
$0.15
$15
$0.19