Cohere's most performant model, excelling at tool use, agents, retrieval augmented generation (RAG), and multilingual use cases. Command A has a 256K context window, only requires two GPUs to run, and has 150% higher throughput compared to Command R+ 08-2024. Ideal for enterprise deployments requiring strong tool integration and multilingual support.
Try NowCohere-specific enterprise deployment
Multilingual rag and tool use
Private deployment requiring cohere models
256,000 tokens
8,000 tokens
$2.50 per 1M tokens
$10 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls