| Model Type | | text generation, multilingual |
|
| Use Cases |
| Areas: | |
| Applications: | | assistant-like chat, natural language generation, synthetic data generation, distillation |
|
| Primary Use Cases: | | Instruction tuned text models for multilingual dialogues |
|
| Limitations: | | Prohibited in violation of laws or regulations, Non-supported languages without fine-tuning and controls |
|
| Considerations: | | Developers responsible for ensuring responsible use in non-supported languages. |
|
|
| Additional Notes | | Static model trained on an offline dataset. Future tuned versions will incorporate safety improvements via community feedback. |
|
| Supported Languages | | en (native), de (high), fr (high), it (high), pt (high), hi (high), es (high), th (high) |
|
| Training Details |
| Data Sources: | | A new mix of publicly available online data |
|
| Data Volume: | |
| Methodology: | |
| Context Length: | |
| Model Architecture: | | Auto-regressive, optimized transformer |
|
|
| Safety Evaluation |
| Methodologies: | | red teaming, adversarial tests |
|
| Risk Categories: | | CBRNE helpfulness, Child Safety, Cyber attack enablement |
|
| Ethical Considerations: | | Developers expected to use system safeguards to tailor safety for specific use cases. |
|
|
| Responsible Ai Considerations |
| Transparency: | | Developers responsible for integrating safeguards with third-party tools. |
|
| Mitigation Strategies: | | Included safety fine-tuning; emphasis on refusals and tone guidance. |
|
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | | Multilingual text and code |
|
|
| Release Notes |
| Version: | |
| Date: | |
| Notes: | | Initial launch with longer context window, multilingual support, and fine-tuning capabilities. |
|
|
|