DeepSeek R1T2 Chimera

tngtech

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent <think> token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Cost-efficient deepseek reasoning

Fast long-context analysis

Tri-merge deepseek reasoning with tools

Technical Specifications

Context Window

163,840 tokens

Max Output

163,840 tokens

Cache Miss Cost

$0.25 per 1M tokens

Non-Reasoning Cost

$0.85 per 1M tokens

Cache Read Cost

$0.125 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus