| LLM Name | Deepseek Qwen2.5 7B Redistil |
| Repository 🤗 | https://huggingface.co/jan-hq/Deepseek-Qwen2.5-7B-Redistil |
| Model Size | 7b |
| Required VRAM | 15.2 GB |
| Updated | 2025-09-23 |
| Maintainer | jan-hq |
| Model Type | qwen2 |
| Model Files | |
| GGUF Quantization | Yes |
| Quantization Type | gguf |
| Model Architecture | Qwen2ForCausalLM |
| Context Length | 131072 |
| Model Max Length | 131072 |
| Transformers Version | 4.48.2 |
| Tokenizer Class | LlamaTokenizer |
| Padding Token | <|end▁of▁sentence|> |
| Vocabulary Size | 152064 |
| Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ...lling ReSearch Gguf Q8 0 Codex | 128K / 8.1 GB | 79 | 2 |
| Pathumma Llm Text 1.0.0 | 128K / 30.5 GB | 1616 | 10 |
| SvelteCodeQwen1.5 7B Chat | 64K / 14.5 GB | 460 | 0 |
| CodeQwen1.5 7B Chat GGUF | 64K / 3 GB | 282 | 2 |
| Qwen2 Cantonese 7B Instruct | 32K / 15.4 GB | 83 | 3 |
| Openthaigpt1.5 7B Instruct | 32K / 15.2 GB | 3638 | 16 |
| Qwen 2.5 7B Threatflux | 32K / 15.5 GB | 27 | 6 |
| ...der 7B Instruct Abliterated V1 | 32K / 15.2 GB | 3 | 2 |
| Qwen2 7B Instruct GGUF | 32K / 3 GB | 1233 | 1 |
| Qwen2 7B Instruct GGUF | 32K / 3 GB | 17 | 0 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟