DPO Model Test1 is an open-source language model by kimdeokgi. Features: 21.4b LLM, VRAM: 43GB, Context: 32K, License: apache-2.0, LLM Explorer Score: 0.14, Arc: 65.7, HellaSwag: 83, MMLU: 67.4, GSM8K: 58.
State-of-the-art instruction fine-tuning methods including direct preference optimization (DPO). Models were linearly merged to boost performance after DPO training.
Note: green Score (e.g. "73.2") means that the model is better than kimdeokgi/dpo_model_test1.
Rank the DPO Model Test1 Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53999 in total.