Trinity Large (Preview) is a 400B-parameter (13B active) sparse mixture-of-experts language model, engineered to scale model capacity while maintaining inference efficiency over long contexts. It delivers strong performance in reasoning-heavy workloads including math, coding-related tasks, and multi-step agent workflows. With a 131K context window and native function calling, it excels at complex tasks requiring deep understanding and structured outputs.
Try NowComplex multi-step reasoning tasks
Coding and math-heavy workloads
Agent workflows requiring strong task decomposition
131,000 tokens
131,000 tokens
$0.25 per 1M tokens
$1 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls