For tasks that demand low latency, GPT-4.1 Nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window. It's ideal for tasks like classification or autocompletion.
Try NowClassification and autocompletion
Ultra-low-latency tasks
High-throughput lightweight processing
1,047,576 tokens
32,768 tokens
$0.10
$0.40
$0.025
$15
$0.19