NVIDIA Nemotron 3 Super 120B A12B FP8 is an open-source language model by nvidia. Features: 120b LLM, VRAM: 128.4GB, Context: 256K, License: other.
| LLM Name | NVIDIA Nemotron 3 Super 120B A12B FP8 |
| Repository ๐ค | https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 |
| Model Size | 120b |
| Required VRAM | 128.4 GB |
| Updated | 2026-03-27 |
| Maintainer | nvidia |
| Model Type | nemotron_h |
| Model Files | |
| Supported Languages | en fr es it de ja zh |
| Model Architecture | NemotronHForCausalLM |
| License | other |
| Context Length | 262144 |
| Model Max Length | 262144 |
| Transformers Version | 4.57.6 |
| Tokenizer Class | PreTrainedTokenizerFast |
| Padding Token | <|im_end|> |
| Vocabulary Size | 131072 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ...Nemotron 3 Super 120B A12B FP8 | 256K / 128.4 GB | 2514 | 9 |
| ...uper 120B A12B BF16 Heretic V2 | 256K / 241.4 GB | 248 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐