Nemotron 3 Super

Released 2026-03 · reasoning · 1.0M tokens · 9 benchmarks · Open weight

Editorial notes

Lanzado 11 marzo 2026 en GTC 2026. Hybrid Mamba-Transformer MoE 120B / 12B activos. LatentMoE architecture. 5x throughput NVFP4 Blackwell. Supera GPT-OSS-120B con +10% throughput/GPU. #1 DeepResearch Bench.

Spec sheet

Empresa: Nvidia
Pais: US
Tipo: reasoning
Release: 2026-03
Context: 1.0M tokens
Params total: 120B
Params active (MoE): 12B
Licencia: nvidia-open-model
Quants: BF16, Q8_0, Q5_K_M, Q4_K_M
Pricing (openrouter): $0.09/$0.45/M
Slug: nemotron-3-super

Quick install

2 tools

ollama.com

ollama run nemotron-3:super

Nota: ~70 GB VRAM Q4_K_M

docs.vllm.ai

vllm serve nvidia/Nemotron-3-Super --tensor-parallel-size 2

Benchmarks (9)

Reasoning 4

Coding 3

Math 1

AIME-2025

American Invitational Mathematics Examination 2025.

90.2

Instruction 1

Arena-Hard

Hard prompts del Arena - 500 tareas desafiantes.

73.9

Cite this model

BibTeX · APA

BibTeX

@misc{frontier-nemotron-3-super,
  title  = {Nemotron 3 Super},
  author = {{Nvidia}},
  year   = {2026},
  note   = {Frontier Benchmarks AI atlas. Accessed 2026-05-08},
  url    = {https://frontierbenchmarks.com/models/nemotron-3-super}
}

APA

Nvidia (2026). Nemotron 3 Super [Large language model]. Frontier Benchmarks AI. Retrieved 2026-05-08, from https://frontierbenchmarks.com/models/nemotron-3-super

Citation refleja la pagina del atlas, no el paper original del modelo. Para el paper, ve a la seccion "Recursos" arriba.

⚔️ Battle vs otro modelo ← Todos los modelos Mas de Nvidia Methodology