LLM Name | Qwen3 30B A3B 128K GGUF |
Repository ๐ค | https://huggingface.co/unsloth/Qwen3-30B-A3B-128K-GGUF |
Base Model(s) | |
Model Size | 30b |
Required VRAM | 9 GB |
Updated | 2025-09-14 |
Maintainer | unsloth |
Model Type | qwen3_moe |
Model Files | |
Supported Languages | en |
GGUF Quantization | Yes |
Quantization Type | gguf|q2|q4_k|q5_k |
Model Architecture | Qwen3MoeForCausalLM |
License | apache-2.0 |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.51.3 |
Vocabulary Size | 151936 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Qwen3 30B A3B GGUF | 40K / 9 GB | 364578 | 245 |
...30B A3B Instruct 2507 MLX 4bit | 256K / 17.2 GB | 102052 | 5 |
...30B A3B Instruct 2507 MLX 8bit | 256K / 32.5 GB | 100137 | 3 |
...r 30B A3B Instruct 4bit Dwq V2 | 256K / 17.2 GB | 1675 | 6 |
...n3 Coder 30B A3B Instruct 4bit | 256K / 17.2 GB | 1956 | 9 |
...oder 30B A3B Instruct 4bit DWQ | 256K / 17.2 GB | 1538 | 5 |
...30B A3B Thinking 2507 MLX 4bit | 256K / 17.2 GB | 2100 | 2 |
...en3 30B A3B Instruct 2507 4bit | 256K / 61.2 GB | 1305 | 8 |
...en3 30B A3B Thinking 2507 4bit | 256K / 17.2 GB | 322 | 3 |
...en3 30B A3B Instruct 2507 8bit | 256K / 32.5 GB | 96 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐