SauerkrautLM Mixtral 8x7B by VAGOsolutions

 »  All LLMs  »  VAGOsolutions  »  SauerkrautLM Mixtral 8x7B   URL Share it on

SauerkrautLM Mixtral 8x7B is an open-source language model by VAGOsolutions. Features: 46.7b LLM, VRAM: 93.6GB, Context: 32K, License: apache-2.0, MoE, LLM Explorer Score: 0.12, Arc: 68.9, HellaSwag: 86, MMLU: 66.7, GSM8K: 47.5.

  Augmentation   Chatml Dataset:argilla/distilabel-mat...   Dataset:open-orca/slimorca   De   Dpo   En   Endpoints compatible   Es   Finetuned   Fr   German   It   Mistral   Mixtral   Moe   Region:us   Safetensors   Sft   Sharded   Tensorflow

SauerkrautLM Mixtral 8x7B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

SauerkrautLM Mixtral 8x7B Parameters and Internals

Model Type 
Mixture of Experts (MoE)
Additional Notes 
Evaluated with lm-evaluation-harness v0.3.0, concerned with data contamination tests showing results below 0.1%. Licensing explains that models may be used for commercial purposes.
Supported Languages 
en (English), de (German), fr (French), it (Italian), es (Spanish)
Training Details 
Data Sources:
Open-Orca/SlimOrca, argilla/distilabel-math-preference-dpo, SauerkrautLM-DPO dataset, Sauerkraut-7b-HerO, HuggingFaceH4/ultrafeedback_binarized
Methodology:
SFT and DPO alignment with German data augmentation
LLM NameSauerkrautLM Mixtral 8x7B
Repository 🤗https://huggingface.co/VAGOsolutions/SauerkrautLM-Mixtral-8x7B 
Model Size46.7b
Required VRAM93.6 GB
Updated2026-05-21
MaintainerVAGOsolutions
Model Typemixtral
Model Files  4.9 GB: 1-of-19   5.0 GB: 2-of-19   5.0 GB: 3-of-19   4.9 GB: 4-of-19   5.0 GB: 5-of-19   5.0 GB: 6-of-19   4.9 GB: 7-of-19   5.0 GB: 8-of-19   5.0 GB: 9-of-19   4.9 GB: 10-of-19   5.0 GB: 11-of-19   5.0 GB: 12-of-19   5.0 GB: 13-of-19   4.9 GB: 14-of-19   5.0 GB: 15-of-19   5.0 GB: 16-of-19   4.9 GB: 17-of-19   5.0 GB: 18-of-19   4.2 GB: 19-of-19
Supported Languagesen de fr it es
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32002
Torch Data Typebfloat16

Quantized Models of the SauerkrautLM Mixtral 8x7B

Model
Likes
Downloads
VRAM
SauerkrautLM Mixtral 8x7B AWQ1727 GB
SauerkrautLM Mixtral 8x7B GGUF919815 GB
SauerkrautLM Mixtral 8x7B AWQ31524 GB
SauerkrautLM Mixtral 8x7B GPTQ21523 GB

Best Alternatives to SauerkrautLM Mixtral 8x7B

Best Alternatives
Context / RAM
Downloads
Likes
Mixtral 8x7B Instruct V0.132K / 93.6 GB8931974683
Nous Hermes 2 Mixtral 8x7B DPO32K / 93.6 GB9047453
Mixtral 8x7B V0.132K / 93.6 GB1120781808
Sensualize Mixtral Bf1632K / 93.6 GB00
Skadi Mixtral V132K / 93.5 GB00
Franziska Mixtral V132K / 93.5 GB00
Typhon Mixtral V132K / 93.4 GB00
Mixtral 8x7B Instruct V0.1 FP832K / 47.1 GB1120110
GritLM 8x7B KTO32K / 93.6 GB79523
Smaug Mixtral V0.132K / 187.7 GB854812
Note: green Score (e.g. "73.2") means that the model is better than VAGOsolutions/SauerkrautLM-Mixtral-8x7B.

Rank the SauerkrautLM Mixtral 8x7B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53999 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a