What are the hardware requirements for NVIDIA Nemotron 3 Super 120B A12B FP8?

NVIDIA Nemotron 3 Super 120B A12B FP8 requires approximately 128.4 GB of VRAM and supports a context window of 256K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed NVIDIA Nemotron 3 Super 120B A12B FP8 and how large is it?

NVIDIA Nemotron 3 Super 120B A12B FP8 is developed by nvidia, a model with 120b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate NVIDIA Nemotron 3 Super 120B A12B FP8?

NVIDIA Nemotron 3 Super 120B A12B FP8 is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

NVIDIA Nemotron 3 Super 120B A12B FP8 by nvidia — VRAM 128.4GB, 256K context

Name: NVIDIA Nemotron 3 Super 120B A12B FP8
Author: nvidia

NVIDIA Nemotron 3 Super 120B A12B FP8 is an open-source language model by nvidia. Features: 120b LLM, VRAM: 128.4GB, Context: 256K, License: other, LLM Explorer Score: 0.44.

Arxiv:2512.20848 Arxiv:2512.20856 Conversational Custom code Dataset:nvidia/nemotron-post-t... Dataset:nvidia/nemotron-pre-tr... De Deploy:azure En Endpoints compatible Es Fr It Ja Latent-moe Modelopt Mtp Nemotron-3 Nemotron h Nvidia Pytorch Region:us Safetensors Sharded Tensorflow Zh

Model Card on HF 🤗: https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8

NVIDIA Nemotron 3 Super 120B A12B FP8 Benchmarks

LLME Score: 0.44381

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

NVIDIA Nemotron 3 Super 120B A12B FP8 (nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8)

🌟 Advertise your project 🚀

NVIDIA Nemotron 3 Super 120B A12B FP8 Parameters and Internals

LLM Name	NVIDIA Nemotron 3 Super 120B A12B FP8
Repository 🤗	https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Model Size	120b
Required VRAM	128.4 GB
Updated	2026-05-11
Maintainer	nvidia
Model Type	nemotron_h
Model Files	5.0 GB: 1-of-26 5.0 GB: 2-of-26 5.0 GB: 3-of-26 5.0 GB: 4-of-26 5.0 GB: 5-of-26 5.0 GB: 6-of-26 5.0 GB: 7-of-26 5.0 GB: 8-of-26 5.0 GB: 9-of-26 5.0 GB: 10-of-26 5.0 GB: 11-of-26 5.0 GB: 12-of-26 5.0 GB: 13-of-26 5.0 GB: 14-of-26 5.0 GB: 15-of-26 5.0 GB: 16-of-26 5.0 GB: 17-of-26 5.0 GB: 18-of-26 5.0 GB: 19-of-26 5.0 GB: 20-of-26 5.0 GB: 21-of-26 5.0 GB: 22-of-26 5.0 GB: 23-of-26 5.0 GB: 24-of-26 5.0 GB: 25-of-26 3.4 GB: 26-of-26
Supported Languages	en fr es it de ja zh
Model Architecture	NemotronHForCausalLM
License	other
Context Length	262144
Model Max Length	262144
Transformers Version	4.57.6
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<\|im_end\|>
Vocabulary Size	131072

Best Alternatives to NVIDIA Nemotron 3 Super 120B A12B FP8

Best Alternatives	Context / RAM	Downloads	Likes
...on 3 Super 120B A12B Base BF16	1024K / 209.5 GB	22678	30
...emotron 3 Super 120B A12B BF16	256K / 194.6 GB	729273	344
...motron 3 Super 120B A12B NVFP4	256K / 80.3 GB	894238	290
...Nemotron 3 Super 120B A12B FP8	256K / 128.4 GB	1486	9
...DIA Nemotron 3 Super 120B A12B	256K / 214.5 GB	1723	3
...motron 3 Super 120B A12B NVFP4	256K / 80.3 GB	46082	22
... Super 64B A12B Math REAP BF16	256K / 128.6 GB	657	1
...uper 120B A12B BF16 Heretic V2	256K / 241.4 GB	2190	3
...20B A12B BF16 REAP 50pct Draft	256K / 128.5 GB	204	6
...emotron 3 Super 120B A12B 5bit	256K / 83.1 GB	2880	2

Note: green Score (e.g. "73.2") means that the model is better than nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8.

Rank the NVIDIA Nemotron 3 Super 120B A12B FP8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 53640 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

NVIDIA Nemotron 3 Super 120B A12B FP8 by nvidia

» All LLMs » nvidia » NVIDIA Nemotron 3 Super 120B A12B FP8 URL Share it on

NVIDIA Nemotron 3 Super 120B A12B FP8 Benchmarks

NVIDIA Nemotron 3 Super 120B A12B FP8 Parameters and Internals

Best Alternatives to NVIDIA Nemotron 3 Super 120B A12B FP8

Rank the NVIDIA Nemotron 3 Super 120B A12B FP8 Capabilities

What open-source LLMs or SLMs are you in search of? 53640 in total.