Relace Apply 3

Relace

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at 10,000 tokens/sec on average. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update> Zero Data Retention is enabled for Relace.

Try Now

Technical Specifications

Context Window

256,000 tokens

Max Output

128,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.85

Non-Reasoning Output

$1.25

Retired

Made legacy on

Reason

Not a chat model

Recommended Replacement

Qwen3.6 Plus

Retired on