Gemini 2.5 Flash Lite

Name: Gemini 2.5 Flash Lite
Brand: Google

Google

Blazingly fast and incredibly affordable, without sacrificing capability. This lightweight model offers an extraordinary 1M token context window with multimodal support at breakthrough pricing. With configurable thinking and tool connectivity, it handles diverse tasks from quick queries to complex document analysis. The massive context window means you can process entire books, codebases, or datasets in a single request. Perfect for high-volume applications and cost-conscious projects that still need quality results.

Try Now

Capabilities

Thinking

Tool Use

Image Input

PDF Input

Technical Specifications

Context Window

1,048,576 tokens

Max Output

65,536 tokens

Pricing

Non-Priority Token Costs (per 1M tokens)

Cache Miss Input

$0.10

Non-Reasoning Output

$0.40

Cache Read Input

$0.01

Priority Token Costs (per 1M tokens)

Cache Miss Input

$0.18

Non-Reasoning Output

$0.72

Cache Read Input

$0.02

Tool Costs (per 1K calls)

Web Search

$35

Code Execution

Legacy

Made legacy on January 20, 2026

Reason

Much poorer instruction following than Gemini 3 Flash, while not being much cheaper.

Recommended Replacement

Gemini 3.1 Flash Lite