Qwen3 Next 80B A3B Thinking NVFP4 is an open-source language model by nvidia. Features: 80b LLM, VRAM: 50.7GB, Context: 256K, License: apache-2.0, LLM Explorer Score: 0.35.
| LLM Name | Qwen3 Next 80B A3B Thinking NVFP4 |
| Repository 🤗 | https://huggingface.co/nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4 |
| Base Model(s) | |
| Model Size | 80b |
| Required VRAM | 50.7 GB |
| Updated | 2026-03-27 |
| Maintainer | nvidia |
| Model Type | qwen3_next |
| Model Files | |
| Model Architecture | Qwen3NextForCausalLM |
| License | apache-2.0 |
| Context Length | 262144 |
| Model Max Length | 262144 |
| Transformers Version | 4.57.1 |
| Tokenizer Class | Qwen2Tokenizer |
| Padding Token | <|im_end|> |
| Vocabulary Size | 151936 |
| Errors | replace |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Qwen3 Next 80B A3B Instruct | 256K / 162.7 GB | 1398721 | 855 |
| Qwen3 Next 80B A3B Instruct | 256K / 162.7 GB | 3126 | 73 |
| Qwen3 Next 80B A3B Thinking | 256K / 162.7 GB | 36 | 7 |
| Qwen3 Next 80B A3B Thinking | 256K / 162.7 GB | 35972 | 487 |
| ...wen3 Next 80B A3B Instruct FP8 | 256K / 81.8 GB | 190600 | 85 |
| ...n3 Next 80B A3B Instruct NVFP4 | 256K / 50.7 GB | 20206 | 37 |
| ...wen3 Next 80B A3B Thinking FP8 | 256K / 81.8 GB | 24473 | 52 |
| Qwen3 Next MoE | 256K / 0 GB | 15643 | 4 |
| ...ext 80B A3B Instruct Mxfp4 Mlx | 256K / 42 GB | 226 | 8 |
| ...0B A3B Instruct Int4 AutoRound | 256K / 42.3 GB | 206 | 9 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟