Gemini 2.5 Flash Lite

google

Blazingly fast and incredibly affordable, without sacrificing capability. This lightweight model offers an extraordinary 1M token context window with multimodal support at breakthrough pricing. With configurable thinking and tool connectivity, it handles diverse tasks from quick queries to complex document analysis. The massive context window means you can process entire books, codebases, or datasets in a single request. Perfect for high-volume applications and cost-conscious projects that still need quality results.

Try Now

Capabilities

Tool Use

PDF Input

Extended Thinking

Image Input

Example Use Cases

Ultra-low-cost task

High-volume batch processing

Simple task with very large context

Technical Specifications

Context Window

1,048,576 tokens

Max Output

65,536 tokens

Cache Miss Cost

$0.10 per 1M tokens

Non-Reasoning Cost

$0.40 per 1M tokens

Cache Read Cost

$0.01 per 1M tokens

Web Search Cost

$35 per 1K calls

Code Execution Cost

$0 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Much poorer instruction following than Gemini 3 Flash, while not being much cheaper.

Recommended Replacement

Gemini 3 Flash