NVIDIA Nemotron 3 Nano 30B A3B MLX BF16 by mlx-community

 »  All LLMs  »  mlx-community  »  NVIDIA Nemotron 3 Nano 30B A3B MLX BF16   URL Share it on

NVIDIA Nemotron 3 Nano 30B A3B MLX BF16 is an open-source language model by mlx-community. Features: 30b LLM, VRAM: 62.9GB, Context: 256K, License: other, Instruction-Based, LLM Explorer Score: 0.25.

Base model:finetune:nvidia/nvi... Base model:nvidia/nvidia-nemot...   Conversational Dataset:nvidia/nemotron-3-nano... Dataset:nvidia/nemotron-agenti... Dataset:nvidia/nemotron-cc-cod... Dataset:nvidia/nemotron-cc-mat...   Dataset:nvidia/nemotron-cc-v2 Dataset:nvidia/nemotron-cc-v2.... Dataset:nvidia/nemotron-compet... Dataset:nvidia/nemotron-instru... Dataset:nvidia/nemotron-math-p... Dataset:nvidia/nemotron-math-v... Dataset:nvidia/nemotron-pretra... Dataset:nvidia/nemotron-pretra... Dataset:nvidia/nemotron-pretra... Dataset:nvidia/nemotron-pretra... Dataset:nvidia/nemotron-pretra... Dataset:nvidia/nemotron-scienc...   De   En   Es   Fr   Instruct   It   Ja   Mlx   Nvidia   Pytorch   Region:us   Safetensors   Sharded   Tensorflow

NVIDIA Nemotron 3 Nano 30B A3B MLX BF16 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

NVIDIA Nemotron 3 Nano 30B A3B MLX BF16 Parameters and Internals

LLM NameNVIDIA Nemotron 3 Nano 30B A3B MLX BF16
Repository 🤗https://huggingface.co/mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-BF16 
Base Model(s)  ...A Nemotron 3 Nano 30B A3B BF16   nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Model Size30b
Required VRAM62.9 GB
Updated2026-06-06
Maintainermlx-community
Model Typenemotron_h
Instruction-BasedYes
Model Files  4.7 GB: 1-of-13   4.1 GB: 2-of-13   5.4 GB: 3-of-13   5.4 GB: 4-of-13   5.3 GB: 5-of-13   5.3 GB: 6-of-13   5.3 GB: 7-of-13   4.1 GB: 8-of-13   5.3 GB: 9-of-13   4.1 GB: 10-of-13   5.3 GB: 11-of-13   5.3 GB: 12-of-13   3.3 GB: 13-of-13
Supported Languagesen es fr de ja it
Model ArchitectureNemotronHForCausalLM
Licenseother
Context Length262144
Model Max Length262144
Transformers Version4.55.4
Tokenizer ClassTokenizersBackend
Vocabulary Size131072
Torch Data Typebfloat16

Quantized Models of the NVIDIA Nemotron 3 Nano 30B A3B MLX BF16

Model
Likes
Downloads
VRAM
...tron 3 Nano 30B A3B OptiQ 4bit259122 GB

Best Alternatives to NVIDIA Nemotron 3 Nano 30B A3B MLX BF16

Best Alternatives
Context / RAM
Downloads
Likes
...A Nemotron 3 Nano 30B A3B BF16256K / 63.2 GB1645889752
... Nemotron 3 Nano 30B A3B NVFP4256K / 19.3 GB14098
...IA Nemotron 3 Nano 30B A3B FP8256K / 32.7 GB889958344
... Nemotron 3 Nano 30B A3B NVFP4256K / 19.3 GB532640151
Nemotron 3 Nano 30B A3B256K / 63.2 GB17431514
...on 3 Nano 30B A3B BF16 Heretic256K / 63.2 GB15332
... Nemotron 3 Nano 30B A3B NVFP4256K / 19.3 GB252411
...otron 3 Nano 30B A3B MLX MXFP4256K / 16.8 GB10413
Nemotron 3 Nano 30B A3B FP8256K / 32.7 GB1107
...motron 3 Nano 30B A3B MLX 6Bit256K / 25.8 GB1382
Note: green Score (e.g. "73.2") means that the model is better than mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-BF16.

Rank the NVIDIA Nemotron 3 Nano 30B A3B MLX BF16 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 54454 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a