Aya Expanse 8B

Cohere

A compact 8B multilingual model designed to rival monolingual performance through innovations in instruction tuning with data arbitrage, preference training, and model merging. Serves 23 languages with fast response times and low latency. Ideal for high-throughput multilingual workloads where cost and speed matter.

Try Now

Technical Specifications

Context Window

8,000 tokens

Max Output

4,000 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.50

Non-Reasoning Output

$1.50

Legacy

Made legacy on

Reason

8B multilingual model; too small for production

Recommended Replacement

Command A Reasoning