AIniverse Get the App
NVIDIA: Nemotron 3 Super (free) logo

NVIDIA: Nemotron 3 Super (free)

NVIDIA

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it delivers over 50% higher token generation compared to leading open models. The model features a 1M token context window for long-term agent coherence, cross-document reasoning, and multi-step task planning. Latent MoE enables calling 4 experts for the inference cost of only one, improving intelligence and generalization. Multi-environment RL training across 10+ environments delivers leading accuracy on benchmarks including AIME 2025, TerminalBench, and SWE-Bench Verified. Fully open with weights, datasets, and recipes under the NVIDIA Open License, Nemotron 3 Super allows easy customization and secure deployment anywhere — from workstation to cloud.

Parameters
120B
Context
262144
Modality
text->text
License
proprietary
Open source
Yes

Open NVIDIA: Nemotron 3 Super (free) in AIniverse

Compare versions, read real ratings, save to your stack.

Open in App

Other versions in the Nemotron family

NVIDIA: Llama 3.1 Nemotron 70B Instruct
70B
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1
253B
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
49B
NVIDIA: Nemotron 3 Nano 30B A3B (free)
30B
NVIDIA: Nemotron Nano 12B 2 VL (free)
12B