Zephyr 7B Beta Marlin is an open-source language model by RedHatAI. Features: 7b LLM, VRAM: 4.1GB, Context: 32K, Quantized, HF Score: 59.1, LLM Explorer Score: 0.25, ELO: 1053, Arc: 62, HellaSwag: 84.5, MMLU: 61.1, TruthfulQA: 57.4, WinoGrande: 78.1, GSM8K: 11.4.
| LLM Name | Zephyr 7B Beta Marlin |
| Repository ๐ค | https://huggingface.co/RedHatAI/zephyr-7b-beta-marlin |
| Base Model(s) | |
| Model Size | 7b |
| Required VRAM | 4.1 GB |
| Updated | 2025-11-07 |
| Maintainer | RedHatAI |
| Model Type | mistral |
| Model Files | |
| GPTQ Quantization | Yes |
| Quantization Type | gptq |
| Model Architecture | MistralForCausalLM |
| Context Length | 32768 |
| Model Max Length | 32768 |
| Transformers Version | 4.37.2 |
| Tokenizer Class | LlamaTokenizer |
| Padding Token | </s> |
| Vocabulary Size | 32000 |
| Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ...enHermes 2.5 Mistral 7B Marlin | 32K / 4.1 GB | 584 | 2 |
| Mistral 7B Instruct V0.3 GPTQ | 32K / 4.2 GB | 62824 | 1 |
| Mistral 7B Instruct V0.2 GPTQ | 32K / 4.2 GB | 47360 | 54 |
| ...ral 7B Instruct V0.3 GPTQ 4bit | 32K / 4.2 GB | 2946 | 18 |
| ...ral 7B Instruct V0.3 GPTQ 4bit | 32K / 4.2 GB | 2095 | 23 |
| ...istral 7B Pruned50 GPTQ Marlin | 32K / 4 GB | 6 | 0 |
| Mistral 7B Unsloth Gptq 8bit | 32K / 7.7 GB | 7 | 0 |
| ...phyr 7B Beta Assistant V1 Gptq | 32K / 4.2 GB | 1 | 1 |
| ...l Neural Chat 7B V3.8 Bit Gptq | 32K / 7.7 GB | 7 | 0 |
| ...lai Mistral 7B V0.1 8 Bit Gptq | 32K / 7.7 GB | 8 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐