| Additional Notes |
|
| LLM Name | Llama 7B 4bit Act |
| Repository ๐ค | https://huggingface.co/wcde/llama-7b-4bit-act |
| Base Model(s) | |
| Model Size | 7b |
| Required VRAM | 3.8 GB |
| Updated | 2025-10-31 |
| Maintainer | wcde |
| Model Type | llama |
| Model Files | |
| Quantization Type | 4bit |
| Model Architecture | LLaMAForCausalLM |
| Transformers Version | 4.27.0.dev0 |
| Tokenizer Class | LlamaTokenizer |
| Vocabulary Size | 32000 |
| Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Llama 7B Onnx Merged Fp16 | 2K / GB | 13 | 7 |
| Alpaca 7B Native 4bit | 0K / 4.5 GB | 7 | 4 |
| Alpaca Native 4bit | 0K / 4.5 GB | 12 | 58 |
| Llama 7B 4bit Gr128 | 0K / 4 GB | 15 | 4 |
| Swallow 7B GPTQ | 4K / 4.1 GB | 1 | 1 |
| Honest Llama2 Chat 7B | 2K / 13.5 GB | 90 | 9 |
| Llama 7B Onnx Merged Fp32 | 2K / GB | 14 | 2 |
| Chatdoctor | 0K / 27 GB | 775 | 12 |
| ...xplore LM Ext 7B Brainstorming | 0K / 27 GB | 5 | 2 |
| Explore LM 7B Rewriting | 0K / 27 GB | 12 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐