Maestro Reasoning is Arcee's flagship analysis model: a 32 B-parameter derivative of Qwen 2.5-32 B tuned with DPO and chain-of-thought RL for step-by-step logic. Compared to the earlier 7 B preview, the production 32 B release widens the context window to 128 k tokens and doubles pass-rate on MATH and GSM-8K, while also lifting code completion accuracy. Its instruction style encourages structured "thought → answer" traces that can be parsed or hidden according to user preference. That transparency pairs well with audit-focused industries like finance or healthcare where seeing the reasoning path matters. In Arcee Conductor, Maestro is automatically selected for complex, multi-constraint queries that smaller SLMs bounce.
Try NowStep-by-step reasoning task
Math or logic problem
Audit-friendly reasoning with visible traces
131,072 tokens
32,000 tokens
$0.90 per 1M tokens
$3.30 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls