Deepseek Llm 67B Base by deepseek-ai

 »  All LLMs  »  deepseek-ai  »  Deepseek Llm 67B Base   URL Share it on

  Autotrain compatible   Endpoints compatible   Llama   Pytorch   Region:us   Sharded

Deepseek Llm 67B Base Benchmarks

Deepseek Llm 67B Base Parameters and Internals

Model Type 
text generation
Additional Notes 
DeepSeek LLM is open source for the research community.
Supported Languages 
English (Proficient), Chinese (Proficient)
Training Details 
Data Volume:
2 trillion tokens
Model Architecture:
Grouped-Query Attention
LLM NameDeepseek Llm 67B Base
Repository 🤗https://huggingface.co/deepseek-ai/deepseek-llm-67b-base 
Model Size67b
Required VRAM135 GB
Updated2025-08-16
Maintainerdeepseek-ai
Model Typellama
Model Files  10.0 GB: 1-of-14   10.0 GB: 2-of-14   9.7 GB: 3-of-14   9.7 GB: 4-of-14   9.7 GB: 5-of-14   9.7 GB: 6-of-14   9.7 GB: 7-of-14   9.7 GB: 8-of-14   9.7 GB: 9-of-14   9.7 GB: 10-of-14   9.7 GB: 11-of-14   9.7 GB: 12-of-14   9.7 GB: 13-of-14   8.3 GB: 14-of-14
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length4096
Model Max Length4096
Transformers Version4.33.1
Tokenizer ClassLlamaTokenizerFast
Beginning of Sentence Token<|begin▁of▁sentence|>
End of Sentence Token<|end▁of▁sentence|>
Vocabulary Size102400
Torch Data Typebfloat16

Quantized Models of the Deepseek Llm 67B Base

Model
Likes
Downloads
VRAM
Deepseek Llm 67B Base GGUF327328 GB
Deepseek Llm 67B Base AWQ23637 GB
Deepseek Llm 67B Base GPTQ11136 GB

Best Alternatives to Deepseek Llm 67B Base

Best Alternatives
Context / RAM
Downloads
Likes
Deepseek Llm 67B Chat4K / 135 GB3051203
...penbuddy Deepseek 67B V18.1 4K4K / 135 GB92171
...penbuddy Deepseek 67B V15 Base4K / 135 GB19440
Openbuddy Deepseek 67B V15.24K / 135 GB194810
Openbuddy Deepseek 67B V15.14K / 135 GB19531
...penbuddy Deepseek 67B V15.3 4K4K / 135 GB61
Deepmoney 67B Chat4K / 135 GB1123
DeepSeek 67B MMIQC4K / 135 GB51
...67B Spicy 3.1 1 5.0bpw H6 EXL24K / 43.7 GB61
...ek Llm 67B Chat 2.4bpw H6 EXL24K / 22.2 GB52
Note: green Score (e.g. "73.2") means that the model is better than deepseek-ai/deepseek-llm-67b-base.

Rank the Deepseek Llm 67B Base Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50705 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124