The highspeed variant of MiniMax M2.7, delivering the same agentic and coding performance at significantly faster inference speeds (~100 tokens per second vs ~60 tps standard). Unlike previous generations, M2.7 Highspeed matches the standard variant on pricing while delivering dramatically lower latency. Shares the same 200K shared context window and 131K max output.
Try NowLow-latency agentic workflows
Real-time coding assistance
Latency-sensitive production deployments
204,800 tokens
131,072 tokens
$0.30
$1.20
$0.06
$0.375
$15
$0.19