Nemotron 3 Super
Released 2026-03 · reasoning · 1.0M tokens · 9 benchmarks · Open weight
Editorial notes
Lanzado 11 marzo 2026 en GTC 2026. Hybrid Mamba-Transformer MoE 120B / 12B activos. LatentMoE architecture. 5x throughput NVFP4 Blackwell. Supera GPT-OSS-120B con +10% throughput/GPU. #1 DeepResearch Bench.
Spec sheet
- Empresa
- Nvidia
- Pais
- US
- Tipo
- reasoning
- Release
- 2026-03
- Context
- 1.0M tokens
- Params total
- 120B
- Params active (MoE)
- 12B
- Licencia
- nvidia-open-model
- Quants
- BF16, Q8_0, Q5_K_M, Q4_K_M
- Pricing (openrouter)
- $0.09/$0.45/M
- Slug
- nemotron-3-super
Quick install
2 toolsollama run nemotron-3:super Nota: ~70 GB VRAM Q4_K_M
vllm serve nvidia/Nemotron-3-Super --tensor-parallel-size 2 Benchmarks (9)
Reasoning 4
- 86.0MMLUMassive Multitask Language Understanding - 57 materias academicas, ~16K pregunta
- 83.3MMLU-ProMMLU mejorado con preguntas mas dificiles y 10 opciones de respuesta.
- 79.4GPQA-DiamondGraduate-level Physics, Chemistry, Biology - preguntas de nivel doctoral.
- 17.4Humanitys-Last-ExamEl benchmark mas dificil conocido - problemas academicos novedosos.
Coding 3
Cite this model
BibTeX · APA
BibTeX
@misc{frontier-nemotron-3-super,
title = {Nemotron 3 Super},
author = {{Nvidia}},
year = {2026},
note = {Frontier Benchmarks AI atlas. Accessed 2026-05-08},
url = {https://frontierbenchmarks.com/models/nemotron-3-super}
} APA
Nvidia (2026). Nemotron 3 Super [Large language model]. Frontier Benchmarks AI. Retrieved 2026-05-08, from https://frontierbenchmarks.com/models/nemotron-3-super
Citation refleja la pagina del atlas, no el paper original del modelo. Para el paper, ve a la seccion "Recursos" arriba.