DeepSeek V4 Pro

DeepSeek

The flagship DeepSeek V4 model, a 1.6T parameter Mixture-of-Experts (49B activated) with a 1M token context window. Matches or exceeds leading closed-source models on coding and agentic benchmarks (93.5% LiveCodeBench, 80.6% SWE-Verified, 3206 Codeforces rating at max reasoning effort). Switches between fast non-thinking responses and explicit chain-of-thought reasoning with configurable effort (up to "max" for the hardest problems like mathematics, competitive programming, and scientific analysis). Tool calls are supported in both modes. Choose this when you need frontier-level quality on complex tasks.

Try Now

Capabilities

Thinking

Tool Use

Technical Specifications

Context Window

1,000,000 tokens

Max Output

262,144 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$1.74

Non-Reasoning Output

$3.48

Cache Read Input

$0.145

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19