The 7B model of the Qwen2.5 series has gained significantly more knowledge compared to Qwen2, and has greatly improved in programming and mathematical abilities. Additionally, the new model has made improvements in executing instructions, generating long texts, understanding structured data (such as tables), and generating structured outputs, particularly JSON. It supports a context of 1M tokens.
Try NowLong context with small model
Budget long-document processing
Lightweight model for large input
1,000,000 tokens
8,192 tokens
$0.368 per 1M tokens
$1.47 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls