Trinity Mini

Arcee

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model, engineered for efficient inference over long contexts with robust function calling and multi-step agent workflows. With 128K context, it delivers an outstanding price-to-performance ratio while maintaining coherent multi-turn reasoning and reliable tool use. Ideal for production deployments where speed and cost efficiency are paramount.

Try Now

Capabilities

Tool Use

Technical Specifications

Context Window

131,072 tokens

Max Output

131,072 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.045

Non-Reasoning Output

$0.15

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

Legacy

Made legacy on

Reason

Small model; superseded by Trinity Large

Recommended Replacement

Qwen3.6 Plus