Kimi's latest and most intelligent model. Next-generation native multimodal architecture with breakthroughs in long-horizon coding, instruction compliance, and self-correction. Handles text, image, and video input across thinking and non-thinking modes, with a 262K context window and 252K output capacity for complex multimodal and agent workflows.
Try Now262,144 tokens
252,144 tokens
$0.95
$4
$0.16
$15
$0.19