OpenAssistant SFT 7 Llama 30B GPTQ is an open-source language model by TheBloke. Features: 30b LLM, VRAM: 16.9GB, Context: 2K, License: other, Quantized, HF Score: 59.3, LLM Explorer Score: 0.1, Arc: 60.6, HellaSwag: 82.2, MMLU: 57.9, TruthfulQA: 46.9, WinoGrande: 78.6, GSM8K: 29.8.
| LLM Name | OpenAssistant SFT 7 Llama 30B GPTQ |
| Repository ๐ค | https://huggingface.co/TheBloke/OpenAssistant-SFT-7-Llama-30B-GPTQ |
| Base Model(s) | |
| Model Size | 30b |
| Required VRAM | 16.9 GB |
| Updated | 2026-04-07 |
| Maintainer | TheBloke |
| Model Type | llama |
| Model Files | |
| GPTQ Quantization | Yes |
| Quantization Type | gptq |
| Model Architecture | LlamaForCausalLM |
| License | other |
| Context Length | 2048 |
| Model Max Length | 2048 |
| Transformers Version | 4.29.0.dev0 |
| Tokenizer Class | LlamaTokenizer |
| End of Sentence Token | </s> |
| Unk Token | </s> |
| Vocabulary Size | 32016 |
| Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| GPlatty 30B SuperHOT 8K GPTQ | 8K / 16.9 GB | 18 | 7 |
| ... 30B Supercot SuperHOT 8K GPTQ | 8K / 16.9 GB | 8 | 5 |
| Platypus 30B SuperHOT 8K GPTQ | 8K / 16.9 GB | 5 | 4 |
| Tulu 30B SuperHOT 8K GPTQ | 8K / 16.9 GB | 3 | 5 |
| Yayi2 30B Llama GPTQ | 4K / 17 GB | 7 | 2 |
| WizardLM 30B GPTQ | 2K / 16.9 GB | 395 | 18 |
| Llama 30B FINAL MODEL MINI | 2K / 19.4 GB | 0 | 1 |
| ...2 Llama 30B 7K Steps Gptq 2bit | 2K / 9.5 GB | 8 | 2 |
| WizardLM 30B V1.0 GPTQ | 2K / 16.9 GB | 3 | 1 |
| ...2 Llama 30B 7K Steps Gptq 4bit | 2K / 17.5 GB | 4 | 3 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐