GPT-4.1 Nano

openai

For tasks that demand low latency, GPT-4.1 Nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window. It's ideal for tasks like classification or autocompletion.

Try Now

Capabilities

Image Input

PDF Input

Tool Use

Example Use Cases

Classification and autocompletion

Ultra-low-latency tasks

High-throughput lightweight processing

Technical Specifications

Context Window

1,047,576 tokens

Max Output

32,768 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.10

Non-Reasoning Output

$0.40

Cache Read Input

$0.025

Tool Costs (per 1K calls)

Web Search

$15

Code Execution

$0.19

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus