Additional Notes |
|
LLM Name | Llama 30B 3bit Gr128 |
Repository ๐ค | https://huggingface.co/wcde/llama-30b-3bit-gr128 |
Model Size | 30b |
Required VRAM | 14 GB |
Updated | 2025-09-20 |
Maintainer | wcde |
Model Type | llama |
Model Files | |
Quantization Type | 3bit |
Model Architecture | LLaMAForCausalLM |
Transformers Version | 4.27.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Vocabulary Size | 32000 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Llama 30B | 0K / 58.5 GB | 24 | 0 |
Llama 30B Int4 | 0K / 17 GB | 18 | 2 |
Llama 30B Int4 | 0K / 17 GB | 17 | 8 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐