Umbra V3 MoE 4x11b 2ex by SteelStorage

 ยป  All LLMs  ยป  SteelStorage  ยป  Umbra V3 MoE 4x11b 2ex   URL Share it on

  Autotrain compatible Base model:beberik/nyxene-v3-1... Base model:decapoda-research/a...   Base model:himitsui/kaiju-11b Base model:merge:beberik/nyxen... Base model:merge:decapoda-rese... Base model:merge:himitsui/kaij... Base model:merge:sao10k/fimbul... Base model:sao10k/fimbulvetr-1...   Beberik/nyxene-v3-11b   Conversational Decapoda-research/antares-11b-...   Endpoints compatible   Frankenmoe   Himitsui/kaiju-11b   Merge   Mergekit   Mixtral   Moe   Region:us   Safetensors   Sao10k/fimbulvetr-11b-v2   Sharded   Tensorflow

Umbra V3 MoE 4x11b 2ex Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Umbra V3 MoE 4x11b 2ex (SteelStorage/Umbra-v3-MoE-4x11b-2ex)
๐ŸŒŸ Advertise your project ๐Ÿš€

Umbra V3 MoE 4x11b 2ex Parameters and Internals

LLM NameUmbra V3 MoE 4x11b 2ex
Repository ๐Ÿค—https://huggingface.co/SteelStorage/Umbra-v3-MoE-4x11b-2ex 
Base Model(s)  Kaiju 11B   Fimbulvetr 11B V2   Antares 11B V2   Nyxene V3 11B   Himitsui/Kaiju-11B   Sao10K/Fimbulvetr-11B-v2   decapoda-research/Antares-11b-v2   beberik/Nyxene-v3-11B
Model Size36.1b
Required VRAM72.3 GB
Updated2025-09-23
MaintainerSteelStorage
Model Typemixtral
Model Files  9.9 GB: 1-of-8   10.0 GB: 2-of-8   10.0 GB: 3-of-8   10.0 GB: 4-of-8   10.0 GB: 5-of-8   10.0 GB: 6-of-8   10.0 GB: 7-of-8   2.4 GB: 8-of-8
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.39.3
Tokenizer ClassLlamaTokenizer
Padding Token<s>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Umbra V3 MoE 4x11b 2ex

Best Alternatives
Context / RAM
Downloads
Likes
Umbra V3 MoE 4x11b 2ex32K / 72.3 GB2864
PiVoT MoE32K / 72.3 GB17908
Umbra V3 MoE 4x11b32K / 72.3 GB55
Umbra V2.1 MoE 4x10.732K / 72.3 GB66
Mixolar 4x7b4K / 72.3 GB97803
Smartsolmix 4x10.7B V14K / 72.3 GB18580
Orca SOLAR 4x10.7B4K / 72.3 GB17380
MetaModel MoE4K / 72.3 GB19140
SOLARC MoE 10.7Bx44K / 144.7 GB19179
Frankenstein MoE En 10.7Bx44K / 72.3 GB19150
Note: green Score (e.g. "73.2") means that the model is better than SteelStorage/Umbra-v3-MoE-4x11b-2ex.

Rank the Umbra V3 MoE 4x11b 2ex Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51535 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124