The 14B model of the Qwen2.5 series has gained significantly more knowledge compared to Qwen2, and has greatly improved in programming and mathematical abilities. Additionally, the new model has made improvements in executing instructions, generating long texts, understanding structured data (such as tables), and generating structured outputs, particularly JSON. It supports a context of 1M tokens.
Try NowExtremely long context with open model
Processing large documents or codebases
Long-context retrieval task
1,000,000 tokens
8,192 tokens
$0.805 per 1M tokens
$3.22 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls