Deepseek Coder 6.7B Base is an open-source language model by deepseek-ai. Features: 6.7b LLM, VRAM: 13.5GB, Context: 16K, License: other, Code Generating, HF Score: 40.9, LLM Explorer Score: 0.21, Arc: 37, HellaSwag: 53.5, MMLU: 38.4, TruthfulQA: 40.3, WinoGrande: 58.1, GSM8K: 18, HumanEval: 45.8.
| Model Type |
| ||||||||||||
| Use Cases |
| ||||||||||||
| Additional Notes |
| ||||||||||||
| Supported Languages |
| ||||||||||||
| Training Details |
|
| LLM Name | Deepseek Coder 6.7B Base |
| Repository 🤗 | https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-base |
| Model Size | 6.7b |
| Required VRAM | 13.5 GB |
| Updated | 2026-03-21 |
| Maintainer | deepseek-ai |
| Model Type | llama |
| Model Files | |
| Generates Code | Yes |
| Model Architecture | LlamaForCausalLM |
| License | other |
| Context Length | 16384 |
| Model Max Length | 16384 |
| Transformers Version | 4.34.1 |
| Tokenizer Class | LlamaTokenizerFast |
| Beginning of Sentence Token | <|begin▁of▁sentence|> |
| End of Sentence Token | <|end▁of▁sentence|> |
| Vocabulary Size | 32256 |
| Torch Data Type | bfloat16 |
Model |
Likes |
Downloads |
VRAM |
|---|---|---|---|
| Deepseek Coder 6.7B Base GGUF | 22 | 2277 | 2 GB |
| Deepseek Coder 6.7B Base GPTQ | 5 | 106 | 3 GB |
| Deepseek Coder 6.7B Base AWQ | 10 | 36 | 3 GB |
| Deepseek Coder 6.7B Base AWQ | 0 | 0 | 3 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Speechless Coder Ds 6.7B | 16K / 13.5 GB | 1110 | 7 |
| ...s Coder6.7b Reflct Adamw Iter1 | 16K / 13.5 GB | 475 | 0 |
| ...Coder6.7b Reflct Rmsprop Iter1 | 16K / 13.5 GB | 95 | 0 |
| ...Coder6.7b Reflct Rmsprop Iter1 | 16K / 13.5 GB | 110 | 0 |
| ...r6.7b Pos Reflct Rmsprop Iter1 | 16K / 13.5 GB | 87 | 0 |
| ...ir4 Ds Coder6.7b Rmsprop Iter1 | 16K / 13.5 GB | 43 | 0 |
| ...r6.7b Pos Reflct Rmsprop Iter1 | 16K / 13.5 GB | 90 | 0 |
| ...Coder6.7b Reflct Rmsprop Iter1 | 16K / 13.5 GB | 62 | 0 |
| Ds Coder6.7b Rmsprop Iter1 | 16K / 13.5 GB | 67 | 0 |
| Code 290K 6.7B Instruct | 16K / 13.5 GB | 129 | 6 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟