SauerkrautLM Mixtral 8x7B by VAGOsolutions

 ยป  All LLMs  ยป  VAGOsolutions  ยป  SauerkrautLM Mixtral 8x7B   URL Share it on

  Augmentation   Autotrain compatible   Chatml Dataset:argilla/distilabel-mat...   Dataset:open-orca/slimorca   De   Dpo   En   Endpoints compatible   Es   Finetuned   Fr   German   It   Mistral   Mixtral   Moe   Region:us   Safetensors   Sft   Sharded   Tensorflow

SauerkrautLM Mixtral 8x7B Benchmarks

SauerkrautLM Mixtral 8x7B (VAGOsolutions/SauerkrautLM-Mixtral-8x7B)
๐ŸŒŸ Advertise your project ๐Ÿš€

SauerkrautLM Mixtral 8x7B Parameters and Internals

Model Type 
Mixture of Experts (MoE)
Additional Notes 
Evaluated with lm-evaluation-harness v0.3.0, concerned with data contamination tests showing results below 0.1%. Licensing explains that models may be used for commercial purposes.
Supported Languages 
en (English), de (German), fr (French), it (Italian), es (Spanish)
Training Details 
Data Sources:
Open-Orca/SlimOrca, argilla/distilabel-math-preference-dpo, SauerkrautLM-DPO dataset, Sauerkraut-7b-HerO, HuggingFaceH4/ultrafeedback_binarized
Methodology:
SFT and DPO alignment with German data augmentation
LLM NameSauerkrautLM Mixtral 8x7B
Repository ๐Ÿค—https://huggingface.co/VAGOsolutions/SauerkrautLM-Mixtral-8x7B 
Model Size46.7b
Required VRAM93.6 GB
Updated2025-09-23
MaintainerVAGOsolutions
Model Typemixtral
Model Files  4.9 GB: 1-of-19   5.0 GB: 2-of-19   5.0 GB: 3-of-19   4.9 GB: 4-of-19   5.0 GB: 5-of-19   5.0 GB: 6-of-19   4.9 GB: 7-of-19   5.0 GB: 8-of-19   5.0 GB: 9-of-19   4.9 GB: 10-of-19   5.0 GB: 11-of-19   5.0 GB: 12-of-19   5.0 GB: 13-of-19   4.9 GB: 14-of-19   5.0 GB: 15-of-19   5.0 GB: 16-of-19   4.9 GB: 17-of-19   5.0 GB: 18-of-19   4.2 GB: 19-of-19
Supported Languagesen de fr it es
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32002
Torch Data Typebfloat16

Quantized Models of the SauerkrautLM Mixtral 8x7B

Model
Likes
Downloads
VRAM
SauerkrautLM Mixtral 8x7B AWQ1627 GB
SauerkrautLM Mixtral 8x7B GGUF924615 GB
SauerkrautLM Mixtral 8x7B AWQ3924 GB
SauerkrautLM Mixtral 8x7B GPTQ21023 GB

Best Alternatives to SauerkrautLM Mixtral 8x7B

Best Alternatives
Context / RAM
Downloads
Likes
Mixtral 8x7B Instruct V0.132K / 93.6 GB4811124637
Nous Hermes 2 Mixtral 8x7B DPO32K / 93.6 GB14686451
Mixtral 8x7B V0.132K / 93.6 GB617351788
Sensualize Mixtral Bf1632K / 93.6 GB00
Skadi Mixtral V132K / 93.5 GB00
Franziska Mixtral V132K / 93.5 GB00
Typhon Mixtral V132K / 93.4 GB00
GritLM 8x7B KTO32K / 93.6 GB82893
Smaug Mixtral V0.132K / 187.7 GB854812
NatureLM 8x7B32K / 0.3 GB7218
Note: green Score (e.g. "73.2") means that the model is better than VAGOsolutions/SauerkrautLM-Mixtral-8x7B.

Rank the SauerkrautLM Mixtral 8x7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51611 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124