| Model Type | |
| Use Cases |
| Areas: | | Research, Coding, Mathematics, Multilingual Applications |
|
| Applications: | | Chatbots, Structured Data Understanding, Role-playing Implementations |
|
| Primary Use Cases: | | instruction following, generating long texts, understanding structured data, generating structured outputs |
|
| Limitations: | | Not recommended for conversations without further training |
|
| Considerations: | | Use post-training methods for conversations like SFT, RLHF |
|
|
| Additional Notes | | Supports multilingual capabilities with improved role-play conditions for chatbots. |
|
| Supported Languages | | English (high), Chinese (high), French (medium), Spanish (medium), Portuguese (medium), German (medium), Italian (medium), Russian (medium), Japanese (medium), Korean (medium), Vietnamese (medium), Thai (medium), Arabic (medium), other_languages (basic) |
|
| Training Details |
| Data Sources: | | specialized expert models in coding and mathematics |
|
| Context Length: | |
| Model Architecture: | | transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias |
|
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | |
| Performance Tips: | | Use the latest version of 'transformers' to avoid errors. |
|
|