Mercury Coder

inception

Mercury Coder is a diffusion-based code-specialized language model from Inception Labs, optimized for code generation, editing, and completion with ultra-fast inference. It supports a 128K chat context window, native tool calling, and structured outputs. Ideal for coding agents and development workflows where speed and accuracy are critical.

Try Now

Capabilities

Tool Use

Example Use Cases

Code generation and editing tasks

Latency-sensitive coding workflows

Tool calling in coding agent pipelines

Technical Specifications

Context Window

128,000 tokens

Max Output

16,384 tokens

Cache Miss Cost

$0.25 per 1M tokens

Non-Reasoning Cost

$1 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus