DeepSeek Qwen2.5 14B DeepThinker V2 by Vijayendra

 »  All LLMs  »  Vijayendra  »  DeepSeek Qwen2.5 14B DeepThinker V2   URL Share it on

  Adapter Base model:adapter:deepseek-ai... Base model:deepseek-ai/deepsee...   Conversational   En   Finetuned   Lora   Peft   Region:us   Safetensors

DeepSeek Qwen2.5 14B DeepThinker V2 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

DeepSeek Qwen2.5 14B DeepThinker V2 Parameters and Internals

LLM NameDeepSeek Qwen2.5 14B DeepThinker V2
Repository 🤗https://huggingface.co/Vijayendra/DeepSeek-Qwen2.5-14B-DeepThinker-v2 
Base Model(s)  DeepSeek R1 Distill Qwen 14B   deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Model Size14b
Required VRAM0.3 GB
Updated2025-07-29
MaintainerVijayendra
Model Files  0.3 GB
Supported Languagesen
Model ArchitectureAdapter
Licensemit
Model Max Length16384
Is Biasednone
Tokenizer ClassLlamaTokenizer
Padding Token<|end▁of▁sentence|>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesself_attn.o_proj|mlp.down_proj|self_attn.q_proj|self_attn.k_proj|self_attn.v_proj|mlp.gate_proj|mlp.up_proj
LoRA Alpha16
LoRA Dropout0.05
R Param16

Best Alternatives to DeepSeek Qwen2.5 14B DeepThinker V2

Best Alternatives
Context / RAM
Downloads
Likes
Qwen Story Test Qlora0K / 1.1 GB331
Vaivcon20K / 0.2 GB50
Qwen Nampdn Ai Tiny Textbooks0K / 0.2 GB01
Careqwen 14B Chat Sft Multi0K / 0 GB51

Rank the DeepSeek Qwen2.5 14B DeepThinker V2 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50230 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124