Trinity Large

Arcee

Trinity Large (Preview) is a 400B-parameter (13B active) sparse mixture-of-experts language model, engineered to scale model capacity while maintaining inference efficiency over long contexts. It delivers strong performance in reasoning-heavy workloads including math, coding-related tasks, and multi-step agent workflows. With a 131K context window and native function calling, it excels at complex tasks requiring deep understanding and structured outputs.

Try Now

Capabilities

Tool Use

Example Use Cases

Complex multi-step reasoning tasks

Coding and math-heavy workloads

Agent workflows requiring strong task decomposition

Technical Specifications

Context Window

131,000 tokens

Max Output

131,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.25

Non-Reasoning Output

$1

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3 Max