Frankenstein MoE En 10.7Bx4 by MoEMoEKKung

 ยป  All LLMs  ยป  MoEMoEKKung  ยป  Frankenstein MoE En 10.7Bx4   URL Share it on

  Autotrain compatible   Conversational   En   Endpoints compatible   Mixtral   Moe   Region:us   Safetensors   Sharded   Tensorflow

Frankenstein MoE En 10.7Bx4 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Frankenstein MoE En 10.7Bx4 (MoEMoEKKung/Frankenstein-MoE-en-10.7Bx4)
๐ŸŒŸ Advertise your project ๐Ÿš€

Frankenstein MoE En 10.7Bx4 Parameters and Internals

Model Type 
MoE
Additional Notes 
Evals in progress.
Supported Languages 
en (proficient)
Training Details 
Data Sources:
H6 trainset, trufulqa generated by GPT-4
Methodology:
To initialize the gate projection weight of the MoE layer, the H6 trainset was sampled and used.
Model Architecture:
MoE layer with gate projection weight initialization.
LLM NameFrankenstein MoE En 10.7Bx4
Repository ๐Ÿค—https://huggingface.co/MoEMoEKKung/Frankenstein-MoE-en-10.7Bx4 
Model Size36.1b
Required VRAM72.3 GB
Updated2025-09-23
MaintainerMoEMoEKKung
Model Typemixtral
Model Files  9.9 GB: 1-of-8   10.0 GB: 2-of-8   10.0 GB: 3-of-8   10.0 GB: 4-of-8   10.0 GB: 5-of-8   10.0 GB: 6-of-8   10.0 GB: 7-of-8   2.4 GB: 8-of-8
Supported Languagesen
Model ArchitectureMixtralForCausalLM
Licensecc-by-nc-sa-4.0
Context Length4096
Model Max Length4096
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Padding Token<s>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to Frankenstein MoE En 10.7Bx4

Best Alternatives
Context / RAM
Downloads
Likes
Umbra V3 MoE 4x11b 2ex32K / 72.3 GB2864
PiVoT MoE32K / 72.3 GB17908
Umbra V3 MoE 4x11b 2ex32K / 72.3 GB54
Umbra V3 MoE 4x11b32K / 72.3 GB55
Umbra V2.1 MoE 4x10.732K / 72.3 GB66
Mixolar 4x7b4K / 72.3 GB97803
Smartsolmix 4x10.7B V14K / 72.3 GB18580
Orca SOLAR 4x10.7B4K / 72.3 GB17380
MetaModel MoE4K / 72.3 GB19140
SOLARC MoE 10.7Bx44K / 144.7 GB19179
Note: green Score (e.g. "73.2") means that the model is better than MoEMoEKKung/Frankenstein-MoE-en-10.7Bx4.

Rank the Frankenstein MoE En 10.7Bx4 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51535 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124