| LLM Name | DeepSeek R1 0528 Qwen3 8B | 
| Repository 🤗 | https://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B | 
| Model Size | 8b | 
| Required VRAM | 16.4 GB | 
| Updated | 2025-09-23 | 
| Maintainer | deepseek-ai | 
| Model Type | qwen3 | 
| Model Files | |
| Model Architecture | Qwen3ForCausalLM | 
| License | mit | 
| Context Length | 131072 | 
| Model Max Length | 131072 | 
| Transformers Version | 4.51.0 | 
| Tokenizer Class | LlamaTokenizerFast | 
| Beginning of Sentence Token | <|begin▁of▁sentence|> | 
| End of Sentence Token | <|end▁of▁sentence|> | 
| Vocabulary Size | 151936 | 
| Torch Data Type | bfloat16 | 
| Model | Likes | Downloads | VRAM | 
|---|---|---|---|
| DeepSeek R1 0528 Qwen3 8B GGUF | 315 | 147745 | 2 GB | 
| DeepSeek R1 0528 Qwen3 8B GGUF | 5 | 93017 | 3 GB | 
| ...0528 Qwen3 8B Unsloth Bnb 4bit | 11 | 16071 | 7 GB | 
| ...Seek R1 Qwen3 0528 8B 4bit AWQ | 4 | 1199 | 4 GB | 
| ...Seek R1 0528 Qwen3 8B Bnb 4bit | 10 | 1967 | 6 GB | 
| DeepSeek R1 0528 Qwen3 8B 4bit | 4 | 1302 | 4 GB | 
| ...Seek R1 0528 Qwen3 8B 4bit DWQ | 8 | 1102 | 4 GB | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| ...n3 8B 320K Context 10X Massive | 320K / 16.4 GB | 20 | 0 | 
| ...r Of Horror Jan V1 256K Ctx 8B | 256K / 16.1 GB | 33 | 3 | 
| ... BIG Jan Horror V1 256K Ctx 8B | 256K / 16.1 GB | 44 | 0 | 
| Qwen3 8B 256K Context 8X Grand | 256K / 16.4 GB | 95 | 0 | 
| ...wen3 8B 192K Context 6X Larger | 192K / 16.4 GB | 55 | 0 | 
| DeepSeek R1 0528 Qwen3 8B | 128K / 16.4 GB | 12490 | 16 | 
| ...1 0528 Qwen3 8B Abliterated V1 | 128K / 16.4 GB | 1085 | 29 | 
| ...1 Qwen3 8B ArliAI RpR V4 Small | 128K / 16.4 GB | 1112 | 17 | 
| ...8 Qwen3 8B Abliterated V1 Bf16 | 128K / 16.3 GB | 399 | 1 | 
| Qwen3 EZO 8B YOYO Karcher 128K | 128K / 16.4 GB | 24 | 1 | 
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟