LLM Name | TinyDeepSeek 0.5B Base |
Repository 🤗 | https://huggingface.co/FreedomIntelligence/TinyDeepSeek-0.5B-base |
Model Size | 0.5b |
Required VRAM | 1.1 GB |
Updated | 2025-08-17 |
Maintainer | FreedomIntelligence |
Model Type | deepseek_v3 |
Model Files | |
Model Architecture | TinyDeepseekV3ForCausalLM |
License | apache-2.0 |
Context Length | 163840 |
Model Max Length | 163840 |
Transformers Version | 4.48.3 |
Tokenizer Class | LlamaTokenizerFast |
Beginning of Sentence Token | <|begin▁of▁sentence|> |
End of Sentence Token | <|end▁of▁sentence|> |
Vocabulary Size | 129280 |
Torch Data Type | bfloat16 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟