| LLM Name | Llama Finetune 16bit Wiki 250 Ver2 GPTQ |
| Repository ๐ค | https://huggingface.co/obamaTeo/llama-finetune-16bit-wiki-250-ver2-GPTQ |
| Model Size | 2b |
| Required VRAM | 5.8 GB |
| Updated | 2025-08-18 |
| Maintainer | obamaTeo |
| Model Type | llama |
| Model Files | |
| GPTQ Quantization | Yes |
| Quantization Type | gptq|16bit |
| Model Architecture | LlamaForCausalLM |
| Context Length | 8192 |
| Model Max Length | 8192 |
| Transformers Version | 4.41.2 |
| Tokenizer Class | PreTrainedTokenizerFast |
| Padding Token | <|reserved_special_token_250|> |
| Vocabulary Size | 128256 |
| Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| EPFL TA Meister GPTQ 4bit | 8K / 5.8 GB | 5 | 0 |
| Llama 3 Teachtechai Gptq | 8K / 5.8 GB | 7 | 0 |
| Mythalion Kimiko V2 GPTQ | 4K / 7.3 GB | 5 | 3 |
| Stheno V2 Delta GPTQ | 4K / 7.3 GB | 9 | 3 |
| ...yn Education Corpus Qa V2 GPTQ | 4K / 7.3 GB | 27 | 3 |
| Athena V4 GPTQ | 4K / 7.3 GB | 10 | 8 |
| Athena V3 GPTQ | 4K / 7.3 GB | 15 | 10 |
| Athena V2 GPTQ | 4K / 7.3 GB | 6 | 4 |
| OpenOrca Stx GPTQ | 4K / 7.3 GB | 19 | 1 |
| ...T Lora Assamble Marcoroni GPTQ | 4K / 7.3 GB | 11 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐