A compact 8B multilingual model designed to rival monolingual performance through innovations in instruction tuning with data arbitrage, preference training, and model merging. Serves 23 languages with fast response times and low latency. Ideal for high-throughput multilingual workloads where cost and speed matter.
Try Now8,000 tokens
4,000 tokens
$0.50
$1.50
8B multilingual model; too small for production