Granite 4.0 Micro

IBM

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long context tool calling.

Try Now

Technical Specifications

Context Window

131,000 tokens

Max Output

131,072 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.017

Non-Reasoning Output

$0.11

Legacy

Made legacy on

Reason

Small IBM Granite model; limited adoption

Recommended Replacement

Qwen3.6 Plus