Pre-trained on a large-scale dataset, followed by supervised fine-tuning and preference-aware training with multi-turn conversational data and plugin augmentation.
Context Length:
2048
Hardware Used:
A100 GPU, NVIDIA 3090 GPU
Model Architecture:
Pre-trained on multi-turn conversational data and plugin-augmented data with supervised and preference-aware fine-tuning.
Safety Evaluation
Methodologies:
red-teaming
Risk Categories:
misinformation, bias
Ethical Considerations:
Acknowledges possible biased, misleading, or incorrect information generation.
Responsible Ai Considerations
Fairness:
Training includes a diverse set of conversation data to address bias.
Transparency:
Model details and code are open-sourced.
Accountability:
Fudan University
Mitigation Strategies:
Fine-tuning on preference data to improve factuality and safety.
Input Output
Input Format:
Text input following conversational format.
Accepted Modalities:
text
Output Format:
Text output with capabilities to use plugins for specialized tasks.
Performance Tips:
Use INT-4/8 models for lower memory requirements; align text input for best context understanding.
Release Notes
Version:
moss-moon-003-base
Date:
N/A
Notes:
Base pre-trained model with 16B parameters, intended for conversational use.
Version:
moss-moon-003-sft
Date:
N/A
Notes:
Supervised fine-tuned version capable of handling multi-turn dialogues.
Version:
moss-moon-003-sft-plugin
Date:
N/A
Notes:
Plugin-augmented for enhanced functionality in specific tasks.
Note: green Score (e.g. "73.2") means that the model is better than fnlp/moss-moon-003-sft-plugin.
Rank the Moss Moon 003 Sft Plugin Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 54028 in total.