| LLM Name | Zephyr 7B Beta Marlin |
| Repository ๐ค | https://huggingface.co/RedHatAI/zephyr-7b-beta-marlin |
| Base Model(s) | |
| Model Size | 7b |
| Required VRAM | 4.1 GB |
| Updated | 2025-10-26 |
| Maintainer | RedHatAI |
| Model Type | mistral |
| Model Files | |
| GPTQ Quantization | Yes |
| Quantization Type | gptq |
| Model Architecture | MistralForCausalLM |
| Context Length | 32768 |
| Model Max Length | 32768 |
| Transformers Version | 4.37.2 |
| Tokenizer Class | LlamaTokenizer |
| Padding Token | </s> |
| Vocabulary Size | 32000 |
| Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ...enHermes 2.5 Mistral 7B Marlin | 32K / 4.1 GB | 584 | 2 |
| ...ral 7B Instruct V0.3 GPTQ 4bit | 32K / 4.2 GB | 10039 | 22 |
| ...ral 7B Instruct V0.3 GPTQ 4bit | 32K / 4.2 GB | 2946 | 18 |
| ...istral 7B Pruned50 GPTQ Marlin | 32K / 4 GB | 6 | 0 |
| Mistral 7B Unsloth Gptq 8bit | 32K / 7.7 GB | 7 | 0 |
| Mistral 7B Instruct V0.2 GPTQ | 32K / 4.2 GB | 5998 | 54 |
| Mistral 7B Instruct V0.3 GPTQ | 32K / 4.2 GB | 223 | 1 |
| ...phyr 7B Beta Assistant V1 Gptq | 32K / 4.2 GB | 6 | 1 |
| ...l Neural Chat 7B V3.8 Bit Gptq | 32K / 7.7 GB | 7 | 0 |
| ...lai Mistral 7B V0.1 8 Bit Gptq | 32K / 7.7 GB | 8 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐