Mistral 7B MoEified 8x is an open-source language model by kalomaze. Features: 7b LLM, VRAM: 14.6GB, Context: 32K, License: apache-2.0, Quantized, LLM Explorer Score: 0.15.
Adaptive computation for efficient token prediction
Additional Notes
The method involves modifying the dense language model by dividing MLP layers into "experts" and initializing router layers for unbiased expert activation.
Training Details
Methodology:
Expert layers division and router layers initialization for equal expert usage.
Model Architecture:
Slicing individual MLP layers into multiple experts with router layers initialized to ensure equal expert usage.
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.