Maestro Reasoning

arcee

Maestro Reasoning is Arcee's flagship analysis model: a 32 B-parameter derivative of Qwen 2.5-32 B tuned with DPO and chain-of-thought RL for step-by-step logic. Compared to the earlier 7 B preview, the production 32 B release widens the context window to 128 k tokens and doubles pass-rate on MATH and GSM-8K, while also lifting code completion accuracy. Its instruction style encourages structured "thought → answer" traces that can be parsed or hidden according to user preference. That transparency pairs well with audit-focused industries like finance or healthcare where seeing the reasoning path matters. In Arcee Conductor, Maestro is automatically selected for complex, multi-constraint queries that smaller SLMs bounce.

Try Now

Example Use Cases

Step-by-step reasoning task

Math or logic problem

Audit-friendly reasoning with visible traces

Technical Specifications

Context Window

131,072 tokens

Max Output

32,000 tokens

Cache Miss Cost

$0.90 per 1M tokens

Non-Reasoning Cost

$3.30 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus