| Model Type | 
 | |
| Additional Notes | 
 | |
| Supported Languages | 
 | 
| LLM Name | Nous Hermes 2 Mixtral 8x7B SFT GGUF | 
| Repository ๐ค | https://huggingface.co/second-state/Nous-Hermes-2-Mixtral-8x7B-SFT-GGUF | 
| Model Name | Nous Hermes 2 Mixtral 8X7B SFT | 
| Model Creator | NousResearch | 
| Base Model(s) | |
| Required VRAM | 17.3 GB | 
| Updated | 2025-10-10 | 
| Maintainer | second-state | 
| Model Type | mixtral | 
| Model Files | |
| Supported Languages | en | 
| GGUF Quantization | Yes | 
| Quantization Type | gguf|q2|q4_k|q5_k | 
| Model Architecture | MixtralForCausalLM | 
| License | apache-2.0 | 
| Context Length | 32768 | 
| Model Max Length | 32768 | 
| Transformers Version | 4.36.0.dev0 | 
| Vocabulary Size | 32002 | 
| Torch Data Type | bfloat16 | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| WizardLM2 2bit | 64K / 4.8 GB | 155 | 0 | 
| Dolphin 2.7 Mixtral 8x7b GGUF | 32K / 15.6 GB | 1802 | 5 | 
| ...Hermes 2 Mixtral 8x7B DPO GGUF | 32K / 17.3 GB | 1229 | 2 | 
| NebulaNet V2 4x7B MoE | 32K / 8.8 GB | 70 | 4 | 
| ...ixtral 8x7B Instruct V0.1 GGUF | 32K / 17.3 GB | 1378 | 5 | 
| Cerebrum 1.0 8x7b GGUF | 32K / 17.3 GB | 13 | 1 | 
| ...oE V0.1 DPO F16 4.0bpw H6 EXL2 | 195K / 31.3 GB | 7 | 0 | 
| ...oE V0.1 DPO F16 5.0bpw H6 EXL2 | 195K / 38.8 GB | 7 | 0 | 
| ...2 Mixtral 8x22b 6.0bpw H8 EXL2 | 64K / 105.8 GB | 5 | 1 | 
| WizardLM 2 8x22 EXL2 4.0bpw | 64K / 70.9 GB | 6 | 1 | 
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐