MoMo 72B Lora 1.8.6 DPO By moreh: Benchmarks, Features and Detailed Analysis. Insights on MoMo 72B Lora 1.8.6 DPO.

Arxiv:2106.09685 Arxiv:2305.18290 Autotrain compatible En Endpoints compatible Llama Lora Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/moreh/MoMo-72B-lora-1.8.6-DPO

MoMo 72B Lora 1.8.6 DPO Benchmarks

ARC: 70.14 vs 96.7 (so35)^-27.5%

HellaSwag: 86.03 vs 95.3 (gpt4)^-9.7%

MMLU: 77.4 vs 88.3 (so35)^-12.3%

TruthfulQA: 69 vs 59 (gpt4)^16.9%

WinoGrande: 84.37 vs 87.5 (gpt4)^-3.6%

GSM8K: 76.8 vs 96.4 (so35)^-20.3%

LLME Score: 0.13839

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

MoMo 72B Lora 1.8.6 DPO (moreh/MoMo-72B-lora-1.8.6-DPO)

🌟 Advertise your project 🚀

MoMo 72B Lora 1.8.6 DPO Parameters and Internals

Model Type

causal language model

Additional Notes

The model is optimized with adjustments in hyperparameters for better performance. No benchmark test dataset was used during training. The trained weight is realigned for compatibility with llama.

Supported Languages

en (proficient)

Training Details

Data Sources:

slimorca, Truthy, orca_dpo_pairs

Methodology:

Training was conducted using Direct Preference Optimization (DPO) and Supervised Fine-Tuning (SFT) using LoRA. No weight merge was exploited.

Hardware Used:

AMD MI250

LLM Name	MoMo 72B Lora 1.8.6 DPO
Repository 🤗	https://huggingface.co/moreh/MoMo-72B-lora-1.8.6-DPO
Model Size	72b
Required VRAM	208.5 GB
Updated	2025-10-03
Maintainer	moreh
Model Type	llama
Model Files	5.0 GB: 1-of-63 4.6 GB: 2-of-63 4.3 GB: 3-of-63 4.3 GB: 4-of-63 4.8 GB: 5-of-63 4.8 GB: 6-of-63 4.3 GB: 7-of-63 4.8 GB: 8-of-63 4.8 GB: 9-of-63 4.3 GB: 10-of-63 4.8 GB: 11-of-63 4.8 GB: 12-of-63 4.3 GB: 13-of-63 4.8 GB: 14-of-63 4.8 GB: 15-of-63 4.3 GB: 16-of-63 4.8 GB: 17-of-63 4.8 GB: 18-of-63 4.3 GB: 19-of-63 4.8 GB: 20-of-63 4.8 GB: 21-of-63 4.3 GB: 22-of-63 4.8 GB: 23-of-63 4.8 GB: 24-of-63 4.3 GB: 25-of-63 4.8 GB: 26-of-63 4.8 GB: 27-of-63 4.3 GB: 28-of-63 4.8 GB: 29-of-63 4.8 GB: 30-of-63 4.3 GB: 31-of-63 4.8 GB: 32-of-63 4.8 GB: 33-of-63 4.3 GB: 34-of-63 4.8 GB: 35-of-63 4.8 GB: 36-of-63 4.3 GB: 37-of-63 4.8 GB: 38-of-63 4.8 GB: 39-of-63 4.3 GB: 40-of-63 4.8 GB: 41-of-63 4.8 GB: 42-of-63 4.3 GB: 43-of-63 4.8 GB: 44-of-63 4.8 GB: 45-of-63
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	mit
Context Length	32768
Model Max Length	32768
Transformers Version	4.36.0
Vocabulary Size	152064
LoRA Model	Yes
Torch Data Type	float32

Best Alternatives to MoMo 72B Lora 1.8.6 DPO

Best Alternatives	Context / RAM	Downloads	Likes
2 Pro Math	128K / 141.9 GB	9	0
Smaug 72B V0.1	32K / 144.5 GB	9165	468
TW3 JRGL V2	32K / 79.7 GB	17750	0
Le Triomphant ECE TW3	32K / 79.7 GB	17774	4
ECE TW3 JRGL V5	32K / 159.6 GB	9742	1
MoMo 72B Lora 1.8.7 DPO	32K / 208.5 GB	2354	68
Rhea 72B V0.5	32K / 144.5 GB	9756	136
JuliusCesar 72B BeyonderV.0	32K / 74.2 GB	5	0
Caigun Model 72B KGI	32K / 144.6 GB	5	0
MoMo 72B LoRA V1.4	32K / 208.5 GB	1720	87

Note: green Score (e.g. "73.2") means that the model is better than moreh/MoMo-72B-lora-1.8.6-DPO.

Rank the MoMo 72B Lora 1.8.6 DPO Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 51536 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241124

Support LLM Explorer

MoMo 72B Lora 1.8.6 DPO by moreh

» All LLMs » moreh » MoMo 72B Lora 1.8.6 DPO URL Share it on

MoMo 72B Lora 1.8.6 DPO Benchmarks

MoMo 72B Lora 1.8.6 DPO Parameters and Internals

Best Alternatives to MoMo 72B Lora 1.8.6 DPO

Rank the MoMo 72B Lora 1.8.6 DPO Capabilities

What open-source LLMs or SLMs are you in search of? 51536 in total.