| Model Type | 
 | |
| Additional Notes | 
 | 
| LLM Name | Gemma 2 9B Instruct 4Bit GPTQ | 
| Repository ๐ค | https://huggingface.co/Granther/Gemma-2-9B-Instruct-4Bit-GPTQ | 
| Model Name | Gemma-2-9B-Instruct-4Bit-GPTQ | 
| Base Model(s) | |
| Model Size | 9b | 
| Required VRAM | 6.2 GB | 
| Updated | 2025-09-23 | 
| Maintainer | Granther | 
| Model Type | gemma2 | 
| Instruction-Based | Yes | 
| Model Files | |
| GPTQ Quantization | Yes | 
| Quantization Type | gptq|4bit | 
| Model Architecture | Gemma2ForCausalLM | 
| License | gemma | 
| Context Length | 8192 | 
| Model Max Length | 8192 | 
| Transformers Version | 4.43.0.dev0 | 
| Tokenizer Class | GemmaTokenizer | 
| Padding Token | <pad> | 
| Vocabulary Size | 256000 | 
| Torch Data Type | float16 | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| ...2 9B Cpt Sahabatai V1 Instruct | 8K / 18.6 GB | 2581 | 45 | 
| SILMA 9B Instruct V1.0 | 8K / 18.6 GB | 12935 | 79 | 
| Gemma Evo 10B | 8K / 20.4 GB | 39 | 5 | 
| Gigantes V2 Gemma2 9B It | 8K / 18.6 GB | 6 | 1 | 
| Gigantes V3 Gemma2 9B It | 8K / 18.6 GB | 5 | 0 | 
| Gigantes V1 Gemma2 9B It | 8K / 18.6 GB | 6 | 2 | 
| Magnum V4 9B | 8K / 18.6 GB | 1085 | 17 | 
| Turkish Gemma 9B V0.1 | 8K / 18.6 GB | 2924 | 32 | 
| Odin 9B | 8K / 18.6 GB | 390 | 5 | 
| ...B IT Simpo Infinity Preference | 8K / 18.6 GB | 1126 | 17 | 
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐