NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security. Note: For the free endpoint, all prompts and output are logged to improve the provider's model and its product and services. Please do not upload any personal, confidential, or otherwise sensitive information. This is a trial use only. Do not use for production or business-critical systems.
Try NowEfficient agentic ai system
Budget moe with tools and thinking
Customizable open nvidia model
262,144 tokens
131,072 tokens
$0.05 per 1M tokens
$0.20 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls