Yi 34B Spicyboros 2 2 Run3 QLoRA by adamo1139

 ยป  All LLMs  ยป  adamo1139  ยป  Yi 34B Spicyboros 2 2 Run3 QLoRA   URL Share it on

  4-bit   Autotrain compatible   Bitsandbytes   Endpoints compatible   Generated from trainer   Llama   Lora   Region:us

Yi 34B Spicyboros 2 2 Run3 QLoRA Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yi 34B Spicyboros 2 2 Run3 QLoRA (adamo1139/Yi-34B-Spicyboros-2-2-run3-QLoRA)
๐ŸŒŸ Advertise your project ๐Ÿš€

Yi 34B Spicyboros 2 2 Run3 QLoRA Parameters and Internals

Model Type 
text generation
Use Cases 
Limitations:
Use is limited by Yi license
Additional Notes 
Thesaurus mode seems to be solved by lowering repetition penalty to 1.0. It can go into loops of repeating sentences occasionally.
Training Details 
Data Sources:
Jon Durbin's Spicyboros 2.2 dataset
Methodology:
Fine-tuning based on Yi-34B Llama
Training Time:
10 hours
Hardware Used:
single RTX 3090 Ti 24GB
Model Architecture:
Llama-fied Yi-34B
Input Output 
Input Format:
ChatML prompt format
LLM NameYi 34B Spicyboros 2 2 Run3 QLoRA
Repository ๐Ÿค—https://huggingface.co/adamo1139/Yi-34B-Spicyboros-2-2-run3-QLoRA 
Model Size34b
Required VRAM0.5 GB
Updated2025-08-18
Maintaineradamo1139
Model Files  0.5 GB
Model ArchitectureAutoModelForCausalLM
Model Max Length4096
Is Biasednone
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesup_proj|q_proj|down_proj|v_proj|gate_proj|o_proj|k_proj
LoRA Alpha32
LoRA Dropout0.05
R Param16

Best Alternatives to Yi 34B Spicyboros 2 2 Run3 QLoRA

Best Alternatives
Context / RAM
Downloads
Likes
...awrr1 LORA DPO Experimental R30K / 0.5 GB14251
Yi 34B Qlora E10K / 5.8 GB17960
Yi 34B 200K AEZAKMI V2 LoRA0K / 0.5 GB11
Yi 34B AEZAKMI V1 LoRA0K / 0.5 GB31
Yi 34B Spicyboros 3.1 2 LoRA0K / 2 GB31
Yi 34B Spicyboros 3.1 LoRA0K / 2 GB34
Limarpv3 Yi Llama 34B Lora0K / 1 GB110
Yi 34B GiftedConvo0K / 5.8 GB32
Note: green Score (e.g. "73.2") means that the model is better than adamo1139/Yi-34B-Spicyboros-2-2-run3-QLoRA.

Rank the Yi 34B Spicyboros 2 2 Run3 QLoRA Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50729 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124