| Model Type |
| |||
| Training Details |
|
| LLM Name | TomGrc FusionNet 34Bx2 MoE V0.1 DPO F16 5.0bpw H6 EXL2 |
| Repository ๐ค | https://huggingface.co/LoneStriker/TomGrc_FusionNet_34Bx2_MoE_v0.1_DPO_f16-5.0bpw-h6-exl2 |
| Required VRAM | 38.8 GB |
| Updated | 2025-09-23 |
| Maintainer | LoneStriker |
| Model Type | mixtral |
| Model Files | |
| Quantization Type | fp16|exl2 |
| Model Architecture | MixtralForCausalLM |
| License | other |
| Context Length | 200000 |
| Model Max Length | 200000 |
| Transformers Version | 4.37.2 |
| Tokenizer Class | LlamaTokenizer |
| Padding Token | <s> |
| Vocabulary Size | 64000 |
| Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ...oE V0.1 DPO F16 4.0bpw H6 EXL2 | 195K / 31.3 GB | 7 | 0 |
| ...2 Mixtral 8x22b 6.0bpw H8 EXL2 | 64K / 105.8 GB | 4 | 1 |
| WizardLM 2 8x22 EXL2 4.0bpw | 64K / 70.9 GB | 0 | 1 |
| ...M 2 8x22B Beige 5.0bpw H6 EXL2 | 64K / 88.5 GB | 11 | 0 |
| ...M 2 8x22B Beige 2.4bpw H6 EXL2 | 64K / 42.7 GB | 6 | 0 |
| ...M 2 8x22B Beige 3.0bpw H6 EXL2 | 64K / 53.2 GB | 6 | 0 |
| ...M 2 8x22B Beige 4.0bpw H6 EXL2 | 64K / 70.8 GB | 5 | 0 |
| ...rdLM 2 8x22B Beige EXL2 5.0bpw | 64K / 88.4 GB | 5 | 0 |
| ...B Instruct V0.1 8.0bpw H8 EXL2 | 64K / 120.2 GB | 10 | 1 |
| ...2 Mixtral 8x22b 8.0bpw H8 EXL2 | 64K / 125.1 GB | 1 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐