Dzakwan MoE 4x7b Beta is an open-source language model by dzakwan. Features: 24.2b LLM, VRAM: 48.4GB, Context: 32K, License: apache-2.0, MoE, HF Score: 75.8, LLM Explorer Score: 0.24, Arc: 72.1, HellaSwag: 88.9, MMLU: 64.9, TruthfulQA: 74.5, WinoGrande: 83.5, GSM8K: 70.8.
| Model Type |
| |||||||||
| Use Cases |
| |||||||||
| Training Details |
| |||||||||
| Input Output |
|
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Beyonder 4x7B V3 | 32K / 48.3 GB | 7856 | 60 |
| Calme 4x7B MoE V0.2 | 32K / 48.3 GB | 8127 | 2 |
| Mera Mix 4x7B | 32K / 48.3 GB | 8157 | 19 |
| Calme 4x7B MoE V0.1 | 32K / 48.3 GB | 8003 | 2 |
| MixtureofMerges MoE 4x7b V5 | 32K / 48.3 GB | 8117 | 1 |
| MixtureofMerges MoE 4x7b V4 | 32K / 48.3 GB | 9068 | 4 |
| CognitiveFusion2 4x7B BF16 | 32K / 48.3 GB | 8185 | 3 |
| Proto Athena 4x7B | 32K / 48.4 GB | 5 | 0 |
| Proto Athena V0.2 4x7B | 32K / 48.4 GB | 6 | 0 |
| NeuralStar FusionWriter 4x7b | 32K / 48.3 GB | 24 | 9 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐