Gemini 2.0 Flash Lite

Google

The most affordable Gemini model, optimized for cost efficiency and low latency. Supports a 1M token context window with multimodal inputs and function calling. No native code execution, web search, or thinking support. Best suited for high-volume simple tasks where cost is the primary concern.

Try Now

Capabilities

Tool Use

Image Input

PDF Input

Example Use Cases

Absolute lowest cost task

High-volume simple processing

Speed over intelligence

Technical Specifications

Context Window

1,048,576 tokens

Max Output

8,192 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.075

Non-Reasoning Output

$0.30

Tool Costs (per 1K calls)

Web Search

$35

Legacy

Made legacy on

Reason

Outdated model

Recommended Replacement

Gemini 3 Flash