Mixtral 8x7B Instruct V0.1 DPO by cloudyu

 ยป  All LLMs  ยป  cloudyu  ยป  Mixtral 8x7B Instruct V0.1 DPO   URL Share it on

Mixtral 8x7B Instruct V0.1 DPO is an open-source language model by cloudyu. Features: 46.7b LLM, VRAM: 93.6GB, Context: 32K, License: apache-2.0, MoE, Instruction-Based, HF Score: 73.4, LLM Explorer Score: 0.13, Arc: 69.8, HellaSwag: 87.8, MMLU: 71.1, TruthfulQA: 69.2, WinoGrande: 81.4, GSM8K: 61.4.

  Conversational   De   En   Endpoints compatible   Es   Fr   Instruct   It   Mixtral   Model-index   Moe   Region:us   Safetensors   Sharded   Tensorflow

Mixtral 8x7B Instruct V0.1 DPO Benchmarks

Mixtral 8x7B Instruct V0.1 DPO (cloudyu/Mixtral-8x7B-Instruct-v0.1-DPO)
๐ŸŒŸ Advertise your project ๐Ÿš€

Mixtral 8x7B Instruct V0.1 DPO Parameters and Internals

Model Type 
text-generation
Additional Notes 
Metrics improved by Truthful DPO training after 100 steps.
Supported Languages 
fr (full), it (full), de (full), es (full), en (full)
Training Details 
Methodology:
DPO training
LLM NameMixtral 8x7B Instruct V0.1 DPO
Repository ๐Ÿค—https://huggingface.co/cloudyu/Mixtral-8x7B-Instruct-v0.1-DPO 
Model Size46.7b
Required VRAM93.6 GB
Updated2026-04-02
Maintainercloudyu
Model Typemixtral
Instruction-BasedYes
Model Files  4.9 GB: 1-of-19   5.0 GB: 2-of-19   5.0 GB: 3-of-19   4.9 GB: 4-of-19   5.0 GB: 5-of-19   5.0 GB: 6-of-19   4.9 GB: 7-of-19   5.0 GB: 8-of-19   5.0 GB: 9-of-19   4.9 GB: 10-of-19   5.0 GB: 11-of-19   5.0 GB: 12-of-19   5.0 GB: 13-of-19   4.9 GB: 14-of-19   5.0 GB: 15-of-19   5.0 GB: 16-of-19   4.9 GB: 17-of-19   5.0 GB: 18-of-19   4.2 GB: 19-of-19
Supported Languagesfr it de es en
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.37.0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Mixtral 8x7B Instruct V0.1 DPO

Best Alternatives
Context / RAM
Downloads
Likes
Mixtral 8x7B Instruct V0.132K / 93.6 GB4895994657
...xtral 8x7B Yes Instruct LimaRP32K / 93.5 GB61
Mixtral 8x7B Instruct V0.1 FP832K / 47.1 GB35020
Merge Mixtral Prometheus 8x7B32K / 91.9 GB362
Notux 8x7b V132K / 93.6 GB38164
...rkrautLM Mixtral 8x7B Instruct32K / 93.6 GB80822
Dolphin 2.5 Mixtral 8x7b32K / 93.6 GB41251235
BagelMIsteryTour V2 8x7B32K / 93.5 GB11317
Mixtral 8x7B Instruct V0.1 FP832K / 47.1 GB3740
Sage Ft Mixtral 8x7b32K / 90 GB1324
Note: green Score (e.g. "73.2") means that the model is better than cloudyu/Mixtral-8x7B-Instruct-v0.1-DPO.

Rank the Mixtral 8x7B Instruct V0.1 DPO Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52721 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a