Qwen2.5 0.5B OpenHermes2.5 is an open-source language model by artificialguybr. Features: 0.5b LLM, VRAM: 1.3GB, Context: 32K, License: apache-2.0, LLM Explorer Score: 0.16.
Qwen2.5 0.5B OpenHermes2.5 Parameters and Internals
Model Type
Causal Language Model
Use Cases
Primary Use Cases:
Research in NLP tasks, Text generation, Language understanding, Conversational AI
Limitations:
Not recommended for direct use in conversations without further fine-tuning, Performance may vary across languages and domains, Potential biases in the training data
Additional Notes
The base Qwen2.5-0.5B model has enhanced capabilities in coding, mathematics, and understanding structured data. It supports long-context processing up to 128K tokens.
Supported Languages
Multilingual support (>29)
Training Details
Data Volume:
1 million samples
Context Length:
32768
Model Architecture:
Transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias, and tied word embeddings
Note: green Score (e.g. "73.2") means that the model is better than artificialguybr/Qwen2.5-0.5B-OpenHermes2.5.
Rank the Qwen2.5 0.5B OpenHermes2.5 Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53493 in total.