Maestro Reasoning

Arcee

Maestro Reasoning is Arcee's flagship analysis model: a 32 B-parameter derivative of Qwen 2.5-32 B tuned with DPO and chain-of-thought RL for step-by-step logic. Compared to the earlier 7 B preview, the production 32 B release widens the context window to 128 k tokens and doubles pass-rate on MATH and GSM-8K, while also lifting code completion accuracy. Its instruction style encourages structured "thought → answer" traces that can be parsed or hidden according to user preference. That transparency pairs well with audit-focused industries like finance or healthcare where seeing the reasoning path matters. In Arcee Conductor, Maestro is automatically selected for complex, multi-constraint queries that smaller SLMs bounce.

Try Now

Technical Specifications

Context Window

131,072 tokens

Max Output

32,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.90

Non-Reasoning Output

$3.30

Legacy

Made legacy on

Reason

Arcee reasoning model; limited testing

Recommended Replacement

Qwen3.6 Plus