Deepseek Llm 67B Base by deepseek-ai

 »  All LLMs  »  deepseek-ai  »  Deepseek Llm 67B Base   URL Share it on

  Autotrain compatible   Endpoints compatible   Llama   Pytorch   Region:us   Sharded

Deepseek Llm 67B Base Benchmarks

Deepseek Llm 67B Base Parameters and Internals

Model Type 
text generation
Additional Notes 
DeepSeek LLM is open source for the research community.
Supported Languages 
English (Proficient), Chinese (Proficient)
Training Details 
Data Volume:
2 trillion tokens
Model Architecture:
Grouped-Query Attention
LLM NameDeepseek Llm 67B Base
Repository 🤗https://huggingface.co/deepseek-ai/deepseek-llm-67b-base 
Model Size67b
Required VRAM135 GB
Updated2025-08-19
Maintainerdeepseek-ai
Model Typellama
Model Files  10.0 GB: 1-of-14   10.0 GB: 2-of-14   9.7 GB: 3-of-14   9.7 GB: 4-of-14   9.7 GB: 5-of-14   9.7 GB: 6-of-14   9.7 GB: 7-of-14   9.7 GB: 8-of-14   9.7 GB: 9-of-14   9.7 GB: 10-of-14   9.7 GB: 11-of-14   9.7 GB: 12-of-14   9.7 GB: 13-of-14   8.3 GB: 14-of-14
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length4096
Model Max Length4096
Transformers Version4.33.1
Tokenizer ClassLlamaTokenizerFast
Beginning of Sentence Token<|begin▁of▁sentence|>
End of Sentence Token<|end▁of▁sentence|>
Vocabulary Size102400
Torch Data Typebfloat16

Quantized Models of the Deepseek Llm 67B Base

Model
Likes
Downloads
VRAM
Deepseek Llm 67B Base GGUF324028 GB
Deepseek Llm 67B Base AWQ23137 GB
Deepseek Llm 67B Base GPTQ1936 GB

Best Alternatives to Deepseek Llm 67B Base

Best Alternatives
Context / RAM
Downloads
Likes
Deepseek Llm 67B Chat4K / 135 GB2797203
...penbuddy Deepseek 67B V18.1 4K4K / 135 GB85021
...penbuddy Deepseek 67B V15 Base4K / 135 GB18110
Openbuddy Deepseek 67B V15.24K / 135 GB181310
Openbuddy Deepseek 67B V15.14K / 135 GB18171
...penbuddy Deepseek 67B V15.3 4K4K / 135 GB61
Deepmoney 67B Chat4K / 135 GB923
DeepSeek 67B MMIQC4K / 135 GB51
...67B Spicy 3.1 1 5.0bpw H6 EXL24K / 43.7 GB61
...ek Llm 67B Chat 2.4bpw H6 EXL24K / 22.2 GB62
Note: green Score (e.g. "73.2") means that the model is better than deepseek-ai/deepseek-llm-67b-base.

Rank the Deepseek Llm 67B Base Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50751 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124