Name: Yi 34B 200K Rawrr1 LORA DPO Experimental R3
Author: adamo1139

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 is an open-source language model by adamo1139. Features: 34b LLM, VRAM: 0.5GB, License: apache-2.0, HF Score: 69.3, LLM Explorer Score: 0.14, Arc: 64.9, HellaSwag: 84.8, MMLU: 76, TruthfulQA: 45.4, WinoGrande: 83.1, GSM8K: 61.6.

4-bit Bitsandbytes Dataset:adamo1139/rawrr v1 Dpo Endpoints compatible Llama Lora Qlora Region:us Safetensors Unsloth

Model Card on HF 🤗: https://huggingface.co/adamo1139/Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Benchmarks

ARC: 64.85 vs 96.7 (so35)^-32.9%

HellaSwag: 84.77 vs 95.3 (gpt4)^-11%

MMLU: 76 vs 88.3 (so35)^-13.9%

TruthfulQA: 45.35 vs 59 (gpt4)^-23.1%

WinoGrande: 83.11 vs 87.5 (gpt4)^-5%

GSM8K: 61.64 vs 96.4 (so35)^-36.1%

LLME Score: 0.13649

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 (adamo1139/Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3)

🌟 Advertise your project 🚀

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Parameters and Internals

Model Type

QLoRA, DPO

Additional Notes

Trained with stronger parameters of lora_r 16, lora alpha 32 compared with previous ones with lora_r 4, lora_alpha 8.

Training Details

Data Sources:

adamo1139/rawrr_v1

Methodology:

QLoRA DPO training with Unsloth

Context Length:

500

LLM Name	Yi 34B 200K Rawrr1 LORA DPO Experimental R3
Repository 🤗	https://huggingface.co/adamo1139/Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3
Model Size	34b
Required VRAM	0.5 GB
Updated	2026-03-30
Maintainer	adamo1139
Model Files	0.5 GB
Model Architecture	AutoModelForCausalLM
License	apache-2.0
Model Max Length	200000
Is Biased	none
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
PEFT Type	LORA
LoRA Model	Yes
PEFT Target Modules	gate_proj\|q_proj\|o_proj\|down_proj\|v_proj\|up_proj\|k_proj
LoRA Alpha	32
LoRA Dropout	0
R Param	16

Best Alternatives to Yi 34B 200K Rawrr1 LORA DPO Experimental R3

Best Alternatives	Context / RAM	Downloads	Likes
Yi 34B Qlora E1	0K / 5.8 GB	765	0
Yi 34B 200K AEZAKMI V2 LoRA	0K / 0.5 GB	3	1
Yi 34B AEZAKMI V1 LoRA	0K / 0.5 GB	5	1
... 34B Spicyboros 2 2 Run3 QLoRA	0K / 0.5 GB	1	1
Yi 34B Spicyboros 3.1 2 LoRA	0K / 2 GB	0	1
Yi 34B Spicyboros 3.1 LoRA	0K / 2 GB	9	4
Limarpv3 Yi Llama 34B Lora	0K / 1 GB	10	10
Limarpv3 Yi Llama 34B Lora	0K / 1 GB	8	10
Yi 34B GiftedConvo	0K / 5.8 GB	2	2

Rank the Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 52392 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 by adamo1139

» All LLMs » adamo1139 » Yi 34B 200K Rawrr1 LORA DPO Experimental R3 URL Share it on

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Benchmarks

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Parameters and Internals

Best Alternatives to Yi 34B 200K Rawrr1 LORA DPO Experimental R3

Rank the Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Capabilities

What open-source LLMs or SLMs are you in search of? 52392 in total.