LLM Name | Qwen3 30B A3B |
Repository ๐ค | https://huggingface.co/Qwen/Qwen3-30B-A3B |
Base Model(s) | |
Model Size | 30b |
Required VRAM | 61.1 GB |
Updated | 2025-07-29 |
Maintainer | Qwen |
Model Type | qwen3_moe |
Model Files | |
Model Architecture | Qwen3MoeForCausalLM |
License | apache-2.0 |
Context Length | 40960 |
Model Max Length | 40960 |
Transformers Version | 4.51.0 |
Tokenizer Class | Qwen2Tokenizer |
Padding Token | <|endoftext|> |
Vocabulary Size | 151936 |
Torch Data Type | bfloat16 |
Errors | replace |
Model |
Likes |
Downloads |
VRAM |
---|---|---|---|
Qwen3 30B A3B GGUF | 234 | 127426 | 9 GB |
Qwen3 30B A3B MLX 4bit | 24 | 54679 | 17 GB |
Qwen3 30B A3B GPTQ Int4 | 19 | 29419 | 16 GB |
Qwen3 30B A3B 128K GGUF | 53 | 8940 | 9 GB |
Qwen3 30B A3B AWQ | 12 | 4483 | 16 GB |
Qwen3 30B A3B AWQ | 12 | 3774 | 16 GB |
...wen3 30B A3B 4bit DWQ 10072025 | 2 | 345 | 17 GB |
Qwen3 30B A3B 4bit DWQ | 27 | 1605 | 17 GB |
Qwen3 30B A3B 4bit DWQ 053125 | 6 | 1138 | 17 GB |
Qwen3 30B A3B 3bit | 2 | 222 | 13 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...0B A6B 16 Extreme 128K Context | 128K / 61.1 GB | 21 | 8 |
Qwen3 30B A3B FP8 | 40K / 32.5 GB | 184702 | 74 |
...en3 30B A3B ArliAI RpR V4 Fast | 40K / 61.1 GB | 1194 | 17 |
...d Qwen3 30B A3B Abliterated V2 | 40K / 61.1 GB | 1090 | 14 |
Qwen3 16B A3B | 40K / 32.1 GB | 563 | 82 |
Qwen3 30B A3B Abliterated | 40K / 61.1 GB | 1895 | 34 |
Qwen3 30B A6B 16 Extreme | 40K / 61.1 GB | 886 | 54 |
...en3 30B A3B Abliterated Erotic | 40K / 61.1 GB | 849 | 21 |
Qwen3 30B A3B MLX Bf16 | 40K / 61.2 GB | 470 | 6 |
Qode 30B | 40K / 61.1 GB | 34 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐