| Model Type | |
| Use Cases |
| Areas: | | Research, Commercial applications |
|
| Applications: | | Assistant-like chat, Natural language generation tasks, Synthetic data generation and distillation |
|
| Primary Use Cases: | | Multilingual dialogue, Commercial and research use |
|
| Limitations: | | Use in unsupported languages is discouraged. |
|
| Considerations: | | Ensure responsible and safe use in alignment with best practices. |
|
|
| Additional Notes | | Note the special conditions for multilingual support beyond the initially tested languages. |
|
| Supported Languages | | English (High proficiency), German (High proficiency), French (High proficiency), Italian (High proficiency), Portuguese (High proficiency), Hindi (High proficiency), Spanish (High proficiency), Thai (High proficiency) |
|
| Training Details |
| Data Sources: | | A new mix of publicly available online data |
|
| Data Volume: | |
| Methodology: | | Supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). |
|
| Context Length: | |
| Training Time: | | 2x faster with Unsloth and Huggingface's TRL library |
|
| Hardware Used: | | Unsloth tool and Hugging Face TRL library |
|
| Model Architecture: | | Auto-regressive language model using an optimized transformer architecture. |
|
|
| Safety Evaluation |
| Methodologies: | | Red teaming, Adversarial testing |
|
| Risk Categories: | | Misinformation, Bias, CBRNE (Chemical, Biological, Radiological, Nuclear, and Explosive materials), Child Safety, Cyber attack enablement |
|
| Ethical Considerations: | | Model may produce inaccurate, biased or objectionable responses. Safety testing and tuning are required before deployment. |
|
|
| Responsible Ai Considerations |
| Fairness: | | Model is intended to serve a diverse range of users without normative constraints. |
|
| Transparency: | | Guidelines for ethical and safe deployment are available. |
|
| Accountability: | | Developers are responsible for evaluating their specific applications. |
|
| Mitigation Strategies: | | Includes safety tuning, adversarial testing, and compliance with a Responsible Use Guide. |
|
|
| Input Output |
| Input Format: | | ChatML and Alpaca prompts |
|
| Accepted Modalities: | |
| Output Format: | | Multilingual text and code |
|
| Performance Tips: | | Use dedicated prompt templates for better performance. |
|
|
| Release Notes |
| Version: | |
| Date: | |
| Notes: | | Released with custom commercial license. Community feedback to improve model safety. |
|
|
|