| Model Type | | text generation, multilingual | 
 | 
| Use Cases | 
| Areas: |  |  | Applications: | | assistant-like chat, natural language generation, synthetic data generation, distillation | 
 |  | Primary Use Cases: | | Instruction tuned text models for multilingual dialogues | 
 |  | Limitations: | | Prohibited in violation of laws or regulations, Non-supported languages without fine-tuning and controls | 
 |  | Considerations: | | Developers responsible for ensuring responsible use in non-supported languages. | 
 |  | 
| Additional Notes | | Static model trained on an offline dataset. Future tuned versions will incorporate safety improvements via community feedback. | 
 | 
| Supported Languages | | en (native), de (high), fr (high), it (high), pt (high), hi (high), es (high), th (high) | 
 | 
| Training Details | 
| Data Sources: | | A new mix of publicly available online data | 
 |  | Data Volume: |  |  | Methodology: |  |  | Context Length: |  |  | Model Architecture: | | Auto-regressive, optimized transformer | 
 |  | 
| Safety Evaluation | 
| Methodologies: | | red teaming, adversarial tests | 
 |  | Risk Categories: | | CBRNE helpfulness, Child Safety, Cyber attack enablement | 
 |  | Ethical Considerations: | | Developers expected to use system safeguards to tailor safety for specific use cases. | 
 |  | 
| Responsible Ai Considerations | 
| Transparency: | | Developers responsible for integrating safeguards with third-party tools. | 
 |  | Mitigation Strategies: | | Included safety fine-tuning; emphasis on refusals and tone guidance. | 
 |  | 
| Input Output | 
| Input Format: |  |  | Accepted Modalities: |  |  | Output Format: | | Multilingual text and code | 
 |  | 
| Release Notes | | 
| Version: |  |  | Date: |  |  | Notes: | | Initial launch with longer context window, multilingual support, and fine-tuning capabilities. | 
 |  | 
 |