| Model Type | | text-generation, multimodal |
|
| Use Cases |
| Areas: | | Commercial applications, Research use |
|
| Applications: | | Assistant-like chat, Natural language generation, Multilingual dialogue |
|
| Primary Use Cases: | | Assistant-like chat, Text completion, Code generation |
|
| Limitations: | | Unsuitable for unsupported languages without additional fine-tuning |
|
| Considerations: | | Encourages developers to responsibly deploy and use safeguards. |
|
|
| Additional Notes | | Model was trained 2x faster using Unsloth and Huggingface's TRL library. |
|
| Supported Languages | | English (High), German (High), French (High), Italian (High), Portuguese (High), Hindi (High), Spanish (High), Thai (High) |
|
| Training Details |
| Data Sources: | | Publicly available online data |
|
| Methodology: | | Supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) |
|
| Context Length: | |
| Model Architecture: | | Auto-regressive language model using an optimized transformer architecture with Grouped-Query Attention (GQA) |
|
|
| Safety Evaluation |
| Methodologies: | | Red teaming, Adversarial testing |
|
| Findings: | | Potential vulnerabilities in multilingual capabilities, Inherent risks in capabilities such as coding and tool calls |
|
| Risk Categories: | | Misinformation, Bias, Cyber attack enablement |
|
| Ethical Considerations: | | Alignment with human preferences for safety through fine-tuning and reinforced learning with feedback. |
|
|
| Responsible Ai Considerations |
| Fairness: | | Model designed to serve a wide range of use cases and backgrounds. |
|
| Transparency: | | Openly shares guidelines and system safeguards. |
|
| Accountability: | | Developers are expected to ensure responsible deployment and system safeguards. |
|
| Mitigation Strategies: | | Employs high-quality data selection and safety tuning datasets. |
|
|
| Input Output |
| Input Format: | | ChatML or Alpaca templates for prompts. |
|
| Accepted Modalities: | |
| Output Format: | | Multilingual text and code outputs. |
|
| Performance Tips: | | Use recommended prompts and ensure system safeguards are in place. |
|
|
| Release Notes |
| Version: | |
| Date: | |
| Notes: | | Improved multilingual capabilities and released longer context window. |
|
|
|