A small, fast model that excels at RAG, tool use, agents, and similar tasks requiring complex reasoning and multiple steps. With 128K context window and extremely low pricing, it is ideal for high-throughput production workloads where cost and speed are paramount.
Try NowCheapest cohere model for rag
High-throughput cohere tool use
Budget agent sub-tasks with cohere
128,000 tokens
4,000 tokens
$0.0375 per 1M tokens
$0.15 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls