Gemini 2.5 Flash Lite

Google

Blazingly fast and incredibly affordable, without sacrificing capability. This lightweight model offers an extraordinary 1M token context window with multimodal support at breakthrough pricing. With configurable thinking and tool connectivity, it handles diverse tasks from quick queries to complex document analysis. The massive context window means you can process entire books, codebases, or datasets in a single request. Perfect for high-volume applications and cost-conscious projects that still need quality results.

Try Now

Capabilities

Thinking

Tool Use

Image Input

PDF Input

Example Use Cases

Ultra-low-cost task

High-volume batch processing

Simple task with very large context

Technical Specifications

Context Window

1,048,576 tokens

Max Output

65,536 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.10

Non-Reasoning Output

$0.40

Cache Read Input

$0.01

Tool Costs (per 1K calls)

Web Search

$35

Code Execution

$0

Legacy

Made legacy on

Reason

Much poorer instruction following than Gemini 3 Flash, while not being much cheaper.

Recommended Replacement

Gemini 3 Flash