What are the hardware requirements for NVIDIA Nemotron 3 Super 120B A12B FP8?

NVIDIA Nemotron 3 Super 120B A12B FP8 requires approximately 128.4 GB of VRAM and supports a context window of 256K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed NVIDIA Nemotron 3 Super 120B A12B FP8 and how large is it?

NVIDIA Nemotron 3 Super 120B A12B FP8 is developed by unsloth, a model with 120b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

How does NVIDIA Nemotron 3 Super 120B A12B FP8 perform on standard benchmarks?

NVIDIA Nemotron 3 Super 120B A12B FP8 has the following published scores: LMSYS ELO 1365. Compare against reference models on this page or on the LLM Explorer leaderboards.

NVIDIA Nemotron 3 Super 120B A12B FP8 by unsloth — VRAM 128.4GB, 256K context

Name: NVIDIA Nemotron 3 Super 120B A12B FP8
Author: unsloth

NVIDIA Nemotron 3 Super 120B A12B FP8 is an open-source language model by unsloth. Features: 120b LLM, VRAM: 128.4GB, Context: 256K, License: other, LLM Explorer Score: 0.43, ELO: 1365.

Arxiv:2512.20848 Arxiv:2512.20856 Base model:nvidia/nvidia-nemot... Base model:quantized:nvidia/nv... Conversational Custom code Dataset:nvidia/nemotron-post-t... Dataset:nvidia/nemotron-pre-tr... De En Endpoints compatible Es Fr It Ja Latent-moe Modelopt Mtp Nemotron-3 Nemotron h Nvidia Pytorch Region:us Safetensors Sharded Tensorflow Unsloth Zh

Model Card on HF 🤗: https://huggingface.co/unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-FP8

NVIDIA Nemotron 3 Super 120B A12B FP8 Benchmarks

LMSys ELO: 1365 vs 1272 (so35)^7.3%

LLME Score: 0.42677

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

NVIDIA Nemotron 3 Super 120B A12B FP8 (unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-FP8)

🌟 Advertise your project 🚀

NVIDIA Nemotron 3 Super 120B A12B FP8 Parameters and Internals

LLM Name	NVIDIA Nemotron 3 Super 120B A12B FP8
Repository 🤗	https://huggingface.co/unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Base Model(s)	...emotron 3 Super 120B A12B BF16 nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Model Size	120b
Required VRAM	128.4 GB
Updated	2026-05-08
Maintainer	unsloth
Model Type	nemotron_h
Model Files	5.0 GB: 1-of-26 5.0 GB: 2-of-26 5.0 GB: 3-of-26 5.0 GB: 4-of-26 5.0 GB: 5-of-26 5.0 GB: 6-of-26 5.0 GB: 7-of-26 5.0 GB: 8-of-26 5.0 GB: 9-of-26 5.0 GB: 10-of-26 5.0 GB: 11-of-26 5.0 GB: 12-of-26 5.0 GB: 13-of-26 5.0 GB: 14-of-26 5.0 GB: 15-of-26 5.0 GB: 16-of-26 5.0 GB: 17-of-26 5.0 GB: 18-of-26 5.0 GB: 19-of-26 5.0 GB: 20-of-26 5.0 GB: 21-of-26 5.0 GB: 22-of-26 5.0 GB: 23-of-26 5.0 GB: 24-of-26 5.0 GB: 25-of-26 3.4 GB: 26-of-26
Supported Languages	en fr es it de ja zh
Model Architecture	NemotronHForCausalLM
License	other
Context Length	262144
Model Max Length	262144
Transformers Version	4.57.6
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<SPECIAL_999>
Vocabulary Size	131072
Torch Data Type	bfloat16

Best Alternatives to NVIDIA Nemotron 3 Super 120B A12B FP8

Best Alternatives	Context / RAM	Downloads	Likes
...on 3 Super 120B A12B Base BF16	1024K / 209.5 GB	22678	30
...emotron 3 Super 120B A12B BF16	256K / 194.6 GB	729273	344
...motron 3 Super 120B A12B NVFP4	256K / 80.3 GB	894238	290
...Nemotron 3 Super 120B A12B FP8	256K / 128.4 GB	370440	244
...DIA Nemotron 3 Super 120B A12B	256K / 214.5 GB	1723	3
...motron 3 Super 120B A12B NVFP4	256K / 80.3 GB	46082	22
... Super 64B A12B Math REAP BF16	256K / 128.6 GB	657	1
...uper 120B A12B BF16 Heretic V2	256K / 241.4 GB	2190	3
...20B A12B BF16 REAP 50pct Draft	256K / 128.5 GB	204	6
...emotron 3 Super 120B A12B 5bit	256K / 83.1 GB	2880	2

Note: green Score (e.g. "73.2") means that the model is better than unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-FP8.

Rank the NVIDIA Nemotron 3 Super 120B A12B FP8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 53640 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

NVIDIA Nemotron 3 Super 120B A12B FP8 by unsloth

» All LLMs » unsloth » NVIDIA Nemotron 3 Super 120B A12B FP8 URL Share it on

NVIDIA Nemotron 3 Super 120B A12B FP8 Benchmarks

NVIDIA Nemotron 3 Super 120B A12B FP8 Parameters and Internals

Best Alternatives to NVIDIA Nemotron 3 Super 120B A12B FP8

Rank the NVIDIA Nemotron 3 Super 120B A12B FP8 Capabilities

What open-source LLMs or SLMs are you in search of? 53640 in total.