Nemotron 3 Nano 30B A3B

nvidia

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security. Note: For the free endpoint, all prompts and output are logged to improve the provider's model and its product and services. Please do not upload any personal, confidential, or otherwise sensitive information. This is a trial use only. Do not use for production or business-critical systems.

Try Now

Capabilities

Tool Use

Extended Thinking

Example Use Cases

Efficient agentic ai system

Budget moe with tools and thinking

Customizable open nvidia model

Technical Specifications

Context Window

262,144 tokens

Max Output

131,072 tokens

Cache Miss Cost

$0.05 per 1M tokens

Non-Reasoning Cost

$0.20 per 1M tokens

Web Search Cost

$15 per 1K calls

Code Execution Cost

$0.19 per 1K calls

⚠️ Legacy

Made legacy on

Reason

Untested

Recommended Replacement

Qwen3.5 Plus