A 7B transformer model, fast-deployed and easily customisable.
Simple text generation
Basic question answering
Fast lightweight inference
32,768 tokens
$0.25 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls
Untested