| Model Type | 
 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Training Details | 
 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Input Output | 
 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Release Notes | 
 | 
| LLM Name | CodeFuse DeepSeek 33B 4bits | 
| Repository 🤗 | https://huggingface.co/codefuse-ai/CodeFuse-DeepSeek-33B-4bits | 
| Base Model(s) | |
| Model Size | 33b | 
| Required VRAM | 18.7 GB | 
| Updated | 2025-09-23 | 
| Maintainer | codefuse-ai | 
| Model Type | llama | 
| Model Files | |
| Supported Languages | en zh | 
| GPTQ Quantization | Yes | 
| Quantization Type | gptq|4bit | 
| Generates Code | Yes | 
| Model Architecture | LlamaForCausalLM | 
| License | other | 
| Context Length | 16384 | 
| Model Max Length | 16384 | 
| Transformers Version | 4.33.2 | 
| Tokenizer Class | LlamaTokenizerFast | 
| Beginning of Sentence Token | <|begin▁of▁sentence|> | 
| End of Sentence Token | <|end▁of▁sentence|> | 
| Padding Token | <|end▁of▁sentence|> | 
| Vocabulary Size | 32256 | 
| Torch Data Type | bfloat16 | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| ...epseek Coder 33B Instruct GPTQ | 16K / 17.4 GB | 1381 | 25 | 
| Everyone Coder 33B Base GPTQ | 16K / 17.4 GB | 8 | 3 | 
| Deepseek Coder 33B Base GPTQ | 16K / 17.4 GB | 24 | 2 | 
| Vicuna 33B Coder GPTQ | 2K / 16.9 GB | 9 | 1 | 
| ...erpreter DS 33B 4.0bpw H6 EXL2 | 16K / 17.1 GB | 7 | 4 | 
| ...erpreter DS 33B 6.0bpw H6 EXL2 | 16K / 25.3 GB | 7 | 1 | 
| ...rpreter DS 33B 4.65bpw H6 EXL2 | 16K / 19.8 GB | 2 | 2 | 
| ...erpreter DS 33B 5.0bpw H6 EXL2 | 16K / 21.2 GB | 6 | 1 | 
| ...erpreter DS 33B 8.0bpw H8 EXL2 | 16K / 33.5 GB | 0 | 1 | 
| ...der 33B V2 Base 8.0bpw H8 EXL2 | 16K / 33.5 GB | 7 | 1 | 
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟