LLaMA2 13B Estopia is an open-source language model by KoboldAI. Features: 13b LLM, VRAM: 26.1GB, Context: 4K, License: cc-by-nc-4.0, Quantized, HF Score: 57.2, LLM Explorer Score: 0.14, Arc: 62.3, HellaSwag: 82.5, MMLU: 55.1, TruthfulQA: 54.1, WinoGrande: 75.8, GSM8K: 13.4.
| Model Type |
| ||||||||||||
| Use Cases |
| ||||||||||||
| Additional Notes |
| ||||||||||||
| Training Details |
| ||||||||||||
| Input Output |
|
| LLM Name | LLaMA2 13B Estopia |
| Repository 🤗 | https://huggingface.co/KoboldAI/LLaMA2-13B-Estopia |
| Base Model(s) | |
| Model Size | 13b |
| Required VRAM | 26.1 GB |
| Updated | 2026-04-20 |
| Maintainer | KoboldAI |
| Model Type | llama |
| Model Files | |
| Quantization Type | fp16 |
| Model Architecture | LlamaForCausalLM |
| License | cc-by-nc-4.0 |
| Context Length | 4096 |
| Model Max Length | 4096 |
| Transformers Version | 4.36.1 |
| Tokenizer Class | LlamaTokenizer |
| Vocabulary Size | 32000 |
| Torch Data Type | bfloat16 |
Model |
Likes |
Downloads |
VRAM |
|---|---|---|---|
| LLaMA2 13B Estopia GGUF | 8 | 207 | 4 GB |
| LLaMA2 13B Estopia GPTQ | 8 | 11 | 7 GB |
| LLaMA2 13B Estopia AWQ | 2 | 14 | 7 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Llama13b 32K Illumeet Finetune | 32K / 26 GB | 5 | 0 |
| ...Maid V3 13B 32K 6.0bpw H6 EXL2 | 32K / 10 GB | 4 | 1 |
| ...Maid V3 13B 32K 8.0bpw H8 EXL2 | 32K / 13.2 GB | 3 | 1 |
| WhiteRabbitNeo 13B V1 | 16K / 26 GB | 2921 | 450 |
| CodeLlama 13B Python Fp16 | 16K / 26 GB | 1447 | 25 |
| CodeLlama 13B Instruct Fp16 | 16K / 26 GB | 903 | 28 |
| CodeLlama 13B Fp16 | 16K / 26 GB | 38 | 67 |
| ...Llama 13B Instruct Hf 4bit MLX | 16K / 7.8 GB | 346 | 3 |
| Codellama 13B Bnb 4bit | 16K / 7.2 GB | 88 | 5 |
| Airophin 13B Pntk 16K Fp16 | 16K / 26 GB | 380 | 4 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟