Yi 34B 200K DARE Megamerge V8 by brucethemoose

 ยป  All LLMs  ยป  brucethemoose  ยป  Yi 34B 200K DARE Megamerge V8   URL Share it on

  Merged Model   Arxiv:2306.01708   Arxiv:2311.03099   Autotrain compatible   En   Endpoints compatible   Llama   Model-index   Region:us   Safetensors   Sharded   Tensorflow   Yi

Yi 34B 200K DARE Megamerge V8 Benchmarks

Yi 34B 200K DARE Megamerge V8 (brucethemoose/Yi-34B-200K-DARE-megamerge-v8)
๐ŸŒŸ Advertise your project ๐Ÿš€

Yi 34B 200K DARE Megamerge V8 Parameters and Internals

Model Type 
text-generation
Additional Notes 
Merged using DARE TIES method to handle models with up to 200,000 context efficiently. Specialized in merging multiple Yi models for improved performance.
Input Output 
Input Format:
Orca-Vicuna template
Accepted Modalities:
text
Output Format:
text
Performance Tips:
Run at a lower temperature with 0.1 or higher MinP, a little repetition penalty, possibly mirostat with low tau.
LLM NameYi 34B 200K DARE Megamerge V8
Repository ๐Ÿค—https://huggingface.co/brucethemoose/Yi-34B-200K-DARE-megamerge-v8 
Merged ModelYes
Model Size34b
Required VRAM68.8 GB
Updated2025-09-15
Maintainerbrucethemoose
Model Typellama
Model Files  9.8 GB: 1-of-7   9.8 GB: 2-of-7   9.8 GB: 3-of-7   10.0 GB: 4-of-7   9.8 GB: 5-of-7   9.8 GB: 6-of-7   9.8 GB: 7-of-7
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length200000
Model Max Length200000
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size64002
Torch Data Typebfloat16

Quantized Models of the Yi 34B 200K DARE Megamerge V8

Model
Likes
Downloads
VRAM
...4B 200K DARE Megamerge V8 GGUF142479 GB
...4B 200K DARE Megamerge V8 GPTQ3918 GB
...34B 200K DARE Megamerge V8 AWQ2519 GB

Best Alternatives to Yi 34B 200K DARE Megamerge V8

Best Alternatives
Context / RAM
Downloads
Likes
34B Beta195K / 69.2 GB967863
Bagel Hermes 34B Slerp195K / 68.9 GB96901
Smaug 34B V0.1195K / 69.2 GB969962
Yi 34B 200K195K / 68.9 GB13047321
Casual Magnum 34B195K / 68.8 GB71
Bagel 34B V0.2195K / 68.7 GB399841
Yi 34B 200K AEZAKMI V2195K / 69.2 GB179912
Smaug 34B V0.1 ExPO195K / 69.2 GB91160
Faro Yi 34B195K / 69.2 GB97066
Bagel DPO 34B V0.5195K / 68.7 GB911717
Note: green Score (e.g. "73.2") means that the model is better than brucethemoose/Yi-34B-200K-DARE-megamerge-v8.

Rank the Yi 34B 200K DARE Megamerge V8 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51369 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124