Aya Expanse 8B

cohere

A compact 8B multilingual model designed to rival monolingual performance through innovations in instruction tuning with data arbitrage, preference training, and model merging. Serves 23 languages with fast response times and low latency. Ideal for high-throughput multilingual workloads where cost and speed matter.

Try Now

Example Use Cases

Fast multilingual generation

Budget-friendly cross-lingual task

High-throughput multilingual processing

Technical Specifications

Context Window

8,000 tokens

Max Output

4,000 tokens

Cache Miss Cost

$0.50 per 1M tokens

Non-Reasoning Cost

$1.50 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Claude Haiku 4.5