Deepseek Llm 67B Spicy 3.1 1 5.0bpw H6 EXL2 by LoneStriker

 »  All LLMs  »  LoneStriker  »  Deepseek Llm 67B Spicy 3.1 1 5.0bpw H6 EXL2   URL Share it on

  Autotrain compatible   Dataset:unalignment/spicy-3.1   Endpoints compatible   Exl2   Llama   Pytorch   Quantized   Region:us   Safetensors   Sharded   Tensorflow

Deepseek Llm 67B Spicy 3.1 1 5.0bpw H6 EXL2 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Deepseek Llm 67B Spicy 3.1 1 5.0bpw H6 EXL2 Parameters and Internals

Model Type 
text generation
Additional Notes 
DeepSeek LLM models are open source for the research community. Text completion example provided in Python.
Supported Languages 
English (High proficiency), Chinese (High proficiency)
Training Details 
Data Sources:
unalignment/spicy-3.1
Data Volume:
2 trillion tokens
Model Architecture:
Grouped-Query Attention
LLM NameDeepseek Llm 67B Spicy 3.1 1 5.0bpw H6 EXL2
Repository 🤗https://huggingface.co/LoneStriker/deepseek-llm-67b-Spicy-3.1-1-5.0bpw-h6-exl2 
Model Size67b
Required VRAM43.7 GB
Updated2025-08-16
MaintainerLoneStriker
Model Typellama
Model Files  8.6 GB: 1-of-6   8.6 GB: 2-of-6   8.6 GB: 3-of-6   8.6 GB: 4-of-6   8.6 GB: 5-of-6   0.7 GB: 6-of-6
Quantization Typeexl2
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length4096
Model Max Length4096
Transformers Version4.35.0
Tokenizer ClassLlamaTokenizer
Padding Token<|end▁of▁sentence|>
Vocabulary Size102400
Torch Data Typefloat16

Best Alternatives to Deepseek Llm 67B Spicy 3.1 1 5.0bpw H6 EXL2

Best Alternatives
Context / RAM
Downloads
Likes
...ek Llm 67B Chat 2.4bpw H6 EXL24K / 22.2 GB52
...k Llm 67B Chat 2.65bpw H6 EXL24K / 24.2 GB51
...ek Llm 67B Chat 3.0bpw H6 EXL24K / 27.1 GB51
Deepseek Llm 67B Chat4K / 135 GB3051203
...penbuddy Deepseek 67B V18.1 4K4K / 135 GB92171
Deepseek Llm 67B Base4K / 135 GB10891125
...penbuddy Deepseek 67B V15 Base4K / 135 GB19440
Openbuddy Deepseek 67B V15.24K / 135 GB194810
Openbuddy Deepseek 67B V15.14K / 135 GB19531
...ddy Deepseek 67B V18.1 4K Gptq4K / 37.6 GB32
Note: green Score (e.g. "73.2") means that the model is better than LoneStriker/deepseek-llm-67b-Spicy-3.1-1-5.0bpw-h6-exl2.

Rank the Deepseek Llm 67B Spicy 3.1 1 5.0bpw H6 EXL2 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50723 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124