Kimi K2 Turbo

Moonshot AI

High-speed version of Kimi K2, always aligned with the latest kimi-k2. Same model parameters with output speed up to 60 tokens/sec (max 100 tokens/sec). Context length 262K with automatic context caching, ToolCalls, JSON Mode, Partial Mode, and internet search support.

Try Now

Capabilities

Tool Use

Example Use Cases

Agentic workflows

Code synthesis

Reasoning tasks

Technical Specifications

Context Window

262,144 tokens

Max Output

16,384 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$1.15

Non-Reasoning Output

$8

Cache Read Input

$0.15

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Poor tool calling capabilities and hallucinates web searches

Recommended Replacement

Kimi K2.5