Llama 2 13B Fp16 is an open-source language model by TheBloke. Features: 13b LLM, VRAM: 26GB, Context: 4K, Quantized, HF Score: 53.7, LLM Explorer Score: 0.12, Arc: 59.3, HellaSwag: 82.2, MMLU: 55.7, TruthfulQA: 37.4, WinoGrande: 76.6, GSM8K: 10.8.
| Model Type |
| |||||||||||||||||||||
| Use Cases |
| |||||||||||||||||||||
| Supported Languages |
| |||||||||||||||||||||
| Training Details |
| |||||||||||||||||||||
| Responsible Ai Considerations |
| |||||||||||||||||||||
| Input Output |
|
| LLM Name | Llama 2 13B Fp16 |
| Repository 🤗 | https://huggingface.co/TheBloke/Llama-2-13B-fp16 |
| Base Model(s) | |
| Model Size | 13b |
| Required VRAM | 26 GB |
| Updated | 2026-04-13 |
| Maintainer | TheBloke |
| Model Type | llama |
| Model Files | |
| Supported Languages | en |
| Quantization Type | fp16 |
| Model Architecture | LlamaForCausalLM |
| Context Length | 4096 |
| Model Max Length | 4096 |
| Transformers Version | 4.30.2 |
| Tokenizer Class | LlamaTokenizer |
| Beginning of Sentence Token | <s> |
| End of Sentence Token | </s> |
| Unk Token | <unk> |
| Vocabulary Size | 32000 |
| Torch Data Type | float16 |
Model |
Likes |
Downloads |
VRAM |
|---|---|---|---|
| LLaMA2 13B Estopia | 21 | 80 | 26 GB |
| ...MA2 13B Estopia 3.0bpw H6 EXL2 | 1 | 5 | 5 GB |
| ...MA2 13B Estopia 8.0bpw H8 EXL2 | 1 | 2 | 13 GB |
| Storytelling V2 13B Lora | 7 | 3 | 0 GB |
| Lmg V3 Lora | 2 | 10 | 0 GB |
| Storytelling V1 13B Lora | 6 | 2 | 0 GB |
| Minotaur Llama2 13B Qlora | 4 | 49 | 1 GB |
| Minotaur Llama2 13B Qlora | 4 | 3 | 1 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Llama13b 32K Illumeet Finetune | 32K / 26 GB | 5 | 0 |
| ...Maid V3 13B 32K 6.0bpw H6 EXL2 | 32K / 10 GB | 4 | 1 |
| ...Maid V3 13B 32K 8.0bpw H8 EXL2 | 32K / 13.2 GB | 3 | 1 |
| WhiteRabbitNeo 13B V1 | 16K / 26 GB | 2921 | 450 |
| CodeLlama 13B Python Fp16 | 16K / 26 GB | 1447 | 25 |
| CodeLlama 13B Instruct Fp16 | 16K / 26 GB | 903 | 28 |
| CodeLlama 13B Fp16 | 16K / 26 GB | 38 | 67 |
| ...Llama 13B Instruct Hf 4bit MLX | 16K / 7.8 GB | 346 | 3 |
| Codellama 13B Bnb 4bit | 16K / 7.2 GB | 88 | 5 |
| Airophin 13B Pntk 16K Fp16 | 16K / 26 GB | 380 | 4 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟