| Model Type |
| |||||||||||||||||||||
| Use Cases |
| |||||||||||||||||||||
| Additional Notes |
| |||||||||||||||||||||
| Supported Languages |
| |||||||||||||||||||||
| Training Details |
| |||||||||||||||||||||
| Safety Evaluation |
| |||||||||||||||||||||
| Responsible Ai Considerations |
| |||||||||||||||||||||
| Input Output |
|
| LLM Name | Llama 2 7B Sharded |
| Repository ๐ค | https://huggingface.co/Xilabs/Llama-2-7b-Sharded |
| Model Size | 7b |
| Required VRAM | 13.4 GB |
| Updated | 2025-09-23 |
| Maintainer | Xilabs |
| Model Type | llama |
| Model Files | |
| Supported Languages | en |
| Model Architecture | LlamaForCausalLM |
| License | other |
| Context Length | 4096 |
| Model Max Length | 4096 |
| Transformers Version | 4.32.0.dev0 |
| Tokenizer Class | LlamaTokenizer |
| Beginning of Sentence Token | <s> |
| End of Sentence Token | </s> |
| Unk Token | <unk> |
| Vocabulary Size | 32000 |
| Torch Data Type | bfloat16 |
Model |
Likes |
Downloads |
VRAM |
|---|---|---|---|
| Llama 2 7B Bf16 Sharded | 74 | 1686 | 13 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| A6 L | 1024K / 16.1 GB | 201 | 0 |
| A3.4 | 1024K / 16.1 GB | 13 | 0 |
| A5.4 | 1024K / 16.1 GB | 12 | 0 |
| A2.4 | 1024K / 16.1 GB | 12 | 0 |
| M | 1024K / 16.1 GB | 127 | 0 |
| 157 | 1024K / 16.1 GB | 101 | 0 |
| 124 | 1024K / 16.1 GB | 93 | 0 |
| 162 | 1024K / 16.1 GB | 60 | 0 |
| 2 Very Sci Fi | 1024K / 16.1 GB | 317 | 0 |
| 118 | 1024K / 16.1 GB | 15 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐