Qwen2.5 14B Instruct 1M

alibaba

The 14B model of the Qwen2.5 series has gained significantly more knowledge compared to Qwen2, and has greatly improved in programming and mathematical abilities. Additionally, the new model has made improvements in executing instructions, generating long texts, understanding structured data (such as tables), and generating structured outputs, particularly JSON. It supports a context of 1M tokens.

Try Now

Capabilities

Tool Use

Example Use Cases

Extremely long context with open model

Processing large documents or codebases

Long-context retrieval task

Technical Specifications

Context Window

1,000,000 tokens

Max Output

8,192 tokens

Cache Miss Cost

$0.805 per 1M tokens

Non-Reasoning Cost

$3.22 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3 Max