Trinity Large Thinking

Arcee

Trinity-Large-Thinking is a reasoning-optimized variant of Arcee AI's Trinity-Large family, a 398B-parameter sparse Mixture-of-Experts model with approximately 13B active parameters per token. Built on Trinity-Large-Base and post-trained with extended chain-of-thought reasoning and agentic RL, it delivers state-of-the-art performance on agentic benchmarks. The model generates explicit reasoning traces wrapped in think blocks before producing its final response. Purpose-built for tool calling, multi-step planning, and agent workflows.

Try Now

Capabilities

Thinking

Tool Use

Example Use Cases

Agentic workflows with multi-step planning

Complex reasoning requiring explicit chain-of-thought

Tool calling with reasoning traces

Technical Specifications

Context Window

131,072 tokens

Max Output

32,768 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.25

Non-Reasoning Output

$0.90

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.6 Plus