Llemma 7b

eleutherai

Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at chain-of-thought mathematical reasoning and using computational tools for mathematics, such as Python and formal theorem provers.

Try Now

Example Use Cases

Mathematical reasoning

Formal theorem proving

Chain-of-thought math problem

Technical Specifications

Context Window

4,096 tokens

Max Output

4,096 tokens

Cache Miss Cost

$0.80 per 1M tokens

Non-Reasoning Cost

$1.20 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus