MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding.
Try Now1,048,576 tokens
131,072 tokens
$0.40
$2
$0.08
$15
$0.19