Qwen2.5 3B Instruct is an open-source language model by unsloth. Features: 3b LLM, VRAM: 6.2GB, Context: 32K, License: other, Instruction-Based, LLM Explorer Score: 0.22.
Qwen2.5 series includes improvements in knowledge, instruction following, and multilingual support across 29 languages. It features long-context support up to 128K tokens for generating up to 8K tokens. The architecture utilizes RoPE, SwiGLU, RMSNorm, and tied word embeddings.
Supported Languages
en (native proficiency), zh (native proficiency), fr (native proficiency), es (native proficiency), pt (native proficiency), de (native proficiency), it (native proficiency), ru (native proficiency), ja (native proficiency), ko (native proficiency), vi (native proficiency), th (native proficiency), ar (native proficiency)
Training Details
Context Length:
32768
Model Architecture:
Transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings
Note: green Score (e.g. "73.2") means that the model is better than unsloth/Qwen2.5-3B-Instruct.
Rank the Qwen2.5 3B Instruct Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53089 in total.