A fast and cost-effective model with native tool use, code execution, and web search grounding. Supports a 1M token context window with multimodal inputs including text, images, audio, and video. Experimental thinking support enables configurable reasoning. Ideal for high-volume tasks that need broad capability at minimal cost.
Try Now1,048,576 tokens
8,192 tokens
$0.15
$0.60
$0.075
$35
$0