Gemini 3 Pro

google

The most advanced Gemini model, pushing the boundaries of multimodal reasoning and complex problem-solving. This preview model excels at sophisticated analytical tasks with support for text, images, audio, video, and documents. With a 1M context window and enhanced reasoning capabilities, it tackles problems that require deep understanding across multiple modalities. Choose this for cutting-edge multimodal work, advanced research, or when you need the absolute best in visual and analytical reasoning. The future of multimodal AI.

Try Now

Capabilities

Tool Use

PDF Input

Extended Thinking

Image Input

Example Use Cases

Advanced multimodal analysis (audio, video)

Most complex gemini task

Cutting-edge multimodal research

Technical Specifications

Context Window

1,000,000 tokens

Max Output

64,000 tokens

Input Cost

$2 per 1M tokens

Output Cost

$12 per 1M tokens

Cache Read Cost

$0.20 per 1M tokens

Web Search Cost

$14 per 1K calls

Code Execution Cost

$0 per 1K calls