Goliath 120B

alpindale

A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale.

Try Now

Example Use Cases

Creative or roleplay writing

Large merged model experiment

Llama-based prose generation

Technical Specifications

Context Window

6,144 tokens

Max Output

1,024 tokens

Cache Miss Cost

$3.75 per 1M tokens

Non-Reasoning Cost

$7.50 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus