Yi 34B 200K Rawrr1 LORA DPO Experimental R3 by adamo1139

 ยป  All LLMs  ยป  adamo1139  ยป  Yi 34B 200K Rawrr1 LORA DPO Experimental R3   URL Share it on

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 is an open-source language model by adamo1139. Features: 34b LLM, VRAM: 0.5GB, License: apache-2.0, HF Score: 69.3, LLM Explorer Score: 0.14, Arc: 64.9, HellaSwag: 84.8, MMLU: 76, TruthfulQA: 45.4, WinoGrande: 83.1, GSM8K: 61.6.

  4-bit   Bitsandbytes   Dataset:adamo1139/rawrr v1   Dpo   Endpoints compatible   Llama   Lora   Qlora   Region:us   Safetensors   Unsloth

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Benchmarks

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 (adamo1139/Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3)
๐ŸŒŸ Advertise your project ๐Ÿš€

Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Parameters and Internals

Model Type 
QLoRA, DPO
Additional Notes 
Trained with stronger parameters of lora_r 16, lora alpha 32 compared with previous ones with lora_r 4, lora_alpha 8.
Training Details 
Data Sources:
adamo1139/rawrr_v1
Methodology:
QLoRA DPO training with Unsloth
Context Length:
500
LLM NameYi 34B 200K Rawrr1 LORA DPO Experimental R3
Repository ๐Ÿค—https://huggingface.co/adamo1139/Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3 
Model Size34b
Required VRAM0.5 GB
Updated2026-03-30
Maintaineradamo1139
Model Files  0.5 GB
Model ArchitectureAutoModelForCausalLM
Licenseapache-2.0
Model Max Length200000
Is Biasednone
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesgate_proj|q_proj|o_proj|down_proj|v_proj|up_proj|k_proj
LoRA Alpha32
LoRA Dropout0
R Param16

Best Alternatives to Yi 34B 200K Rawrr1 LORA DPO Experimental R3

Best Alternatives
Context / RAM
Downloads
Likes
Yi 34B Qlora E10K / 5.8 GB7650
Yi 34B 200K AEZAKMI V2 LoRA0K / 0.5 GB31
Yi 34B AEZAKMI V1 LoRA0K / 0.5 GB51
... 34B Spicyboros 2 2 Run3 QLoRA0K / 0.5 GB11
Yi 34B Spicyboros 3.1 2 LoRA0K / 2 GB01
Yi 34B Spicyboros 3.1 LoRA0K / 2 GB94
Limarpv3 Yi Llama 34B Lora0K / 1 GB1010
Limarpv3 Yi Llama 34B Lora0K / 1 GB810
Yi 34B GiftedConvo0K / 5.8 GB22

Rank the Yi 34B 200K Rawrr1 LORA DPO Experimental R3 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52392 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a