DeepSeek R1 0528 Qwen3 8B GGUF by MaziyarPanahi

 ยป  All LLMs  ยป  MaziyarPanahi  ยป  DeepSeek R1 0528 Qwen3 8B GGUF   URL Share it on

  2-bit   3-bit   4-bit   5-bit   6-bit   8-bit Base model:deepseek-ai/deepsee... Base model:quantized:deepseek-...   Conversational   Gguf   Mistral   Quantized   Region:us

DeepSeek R1 0528 Qwen3 8B GGUF Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
๐ŸŒŸ Advertise your project ๐Ÿš€

DeepSeek R1 0528 Qwen3 8B GGUF Parameters and Internals

LLM NameDeepSeek R1 0528 Qwen3 8B GGUF
Repository ๐Ÿค—https://huggingface.co/MaziyarPanahi/DeepSeek-R1-0528-Qwen3-8B-GGUF 
Model NameDeepSeek-R1-0528-Qwen3-8B-GGUF
Model Creatordeepseek-ai
Base Model(s)  DeepSeek R1 0528 Qwen3 8B   deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
Model Size8b
Required VRAM3.3 GB
Updated2025-06-09
MaintainerMaziyarPanahi
Model Typemistral
Model Files  3.3 GB   4.4 GB   4.1 GB   5.0 GB   5.8 GB   6.7 GB   16.4 GB
GGUF QuantizationYes
Quantization Typegguf
DeepSeek R1 0528 Qwen3 8B GGUF (MaziyarPanahi/DeepSeek-R1-0528-Qwen3-8B-GGUF)

Rank the DeepSeek R1 0528 Qwen3 8B GGUF Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48046 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124