An instruction-following conversational model that performs language tasks at a higher quality, more reliably, and with a longer context than previous models. Best suited for complex RAG workflows and multi-step tool use. 128K context window with 4K max output.
Try NowComplex rag with cohere
Multi-step tool use workflow
Enterprise cohere deployment
128,000 tokens
4,000 tokens
$2.50 per 1M tokens
$10 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls