| LLM Name | Llama 2 7B Chat Hf 4bits Q |
| Repository ๐ค | https://huggingface.co/AAProject/Llama-2-7b-chat-hf-4bits-Q |
| Model Size | 7b |
| Required VRAM | 4.2 GB |
| Updated | 2025-09-23 |
| Maintainer | AAProject |
| Model Type | llama |
| Model Files | |
| Model Architecture | LlamaForCausalLM |
| Context Length | 4096 |
| Model Max Length | 4096 |
| Transformers Version | 4.42.0.dev0 |
| Tokenizer Class | LlamaTokenizer |
| Vocabulary Size | 32000 |
| Torch Data Type | float16 |
Model |
Likes |
Downloads |
VRAM |
|---|---|---|---|
| Llama 2 7B Chat 4bit Gptq | 1 | 4 | 3 GB |
| ...lama 2 7B Chat Hf 4bit G64 HQQ | 3 | 11 | 4 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| A6 L | 1024K / 16.1 GB | 201 | 0 |
| A3.4 | 1024K / 16.1 GB | 13 | 0 |
| A5.4 | 1024K / 16.1 GB | 12 | 0 |
| A2.4 | 1024K / 16.1 GB | 12 | 0 |
| M | 1024K / 16.1 GB | 127 | 0 |
| 157 | 1024K / 16.1 GB | 101 | 0 |
| 124 | 1024K / 16.1 GB | 93 | 0 |
| 162 | 1024K / 16.1 GB | 60 | 0 |
| 2 Very Sci Fi | 1024K / 16.1 GB | 317 | 0 |
| 118 | 1024K / 16.1 GB | 15 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐