| LLM Name | Llama 8B Gguf | 
| Repository ๐ค | https://huggingface.co/empower-dev-staging/llama-8b-gguf | 
| Model Size | 8b | 
| Required VRAM | 4.9 GB | 
| Updated | 2025-09-28 | 
| Maintainer | empower-dev-staging | 
| Model Type | llama | 
| Instruction-Based | Yes | 
| Model Files | |
| GGML Quantization | Yes | 
| GGUF Quantization | Yes | 
| Quantization Type | gguf|ggml|q4|q4_k | 
| Model Architecture | LlamaForCausalLM | 
| Context Length | 8192 | 
| Model Max Length | 8192 | 
| Transformers Version | 4.40.0.dev0 | 
| Tokenizer Class | PreTrainedTokenizerFast | 
| Padding Token | <|end_of_text|> | 
| Vocabulary Size | 128256 | 
| Torch Data Type | bfloat16 | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| ...del Llama 3.1 8B Instruct 4bit | 128K / 16.1 GB | 28 | 0 | 
| ...orean Llama3.1 Sft Rlhf DPO 8B | 128K / 16.1 GB | 1106 | 3 | 
| ...AI Korean Llama 3.1 Sft DPO 8B | 128K / 16.1 GB | 1174 | 7 | 
| ...Ko Llama3 Instruct DPO 8B Base | 8K / 16.1 GB | 457 | 0 | 
| Llama 3 8B Instruct Chinese | 8K / 16.1 GB | 1289 | 35 | 
| ...Llama 3 8B Instruct Fine Tuned | 8K / 18.8 GB | 13 | 0 | 
| Rag Tge Pl Llama 3 8B | 8K / 16.1 GB | 26 | 0 | 
| ...3 Empower Functions Small Gguf | 8K / 4.9 GB | 26 | 0 | 
| ...truct Gradient 1048K IMat GGUF | 1024K / 2 GB | 1523 | 6 | 
| ...B Instruct Gradient 1048K GGUF | 1024K / 3.2 GB | 1231 | 3 | 
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐