LLM Name | Llama 3 3 Nemotron Super 49B V1 FP8 |
Repository ๐ค | https://huggingface.co/nvidia/Llama-3_3-Nemotron-Super-49B-v1-FP8 |
Model Size | 49b |
Required VRAM | 52 GB |
Updated | 2025-07-30 |
Maintainer | nvidia |
Model Type | nemotron-nas |
Model Files | |
Supported Languages | en |
Model Architecture | DeciLMForCausalLM |
License | other |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.48.3 |
Tokenizer Class | PreTrainedTokenizerFast |
Padding Token | <|eot_id|> |
Vocabulary Size | 128256 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...lama 3 3 Nemotron Super 49B V1 | 128K / 100.2 GB | 21777 | 317 |
...ma 3 3 Nemotron Super 49B V1.5 | 128K / 100.2 GB | 1678 | 102 |
...a 3 3 Nemotron Super 49B GenRM | 128K / 99.7 GB | 199 | 11 |
...n Super 49B GenRM Multilingual | 128K / 99.7 GB | 232 | 6 |
...lama 3 3 Nemotron Super 49B V1 | 128K / 100.2 GB | 16 | 0 |
...3 3 Nemotron Super 49B V1 GGUF | 128K / 11.4 GB | 3187 | 8 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐