Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding, video analysis, object detection, and agentic tool-use.
Try NowEfficient multimodal image understanding
Lightweight vision-language task
Budget agentic tool use
16,384 tokens
16,384 tokens
$0.10
$0.10
$15
$0.19