| Model Type | |
| Use Cases |
| Areas: | | research, commercial applications |
|
|
| Additional Notes | | Model includes a comprehensive 150,000-token vocabulary optimized for multi-languages. |
|
| Supported Languages | | zh (high), en (high), multilingual (medium) |
|
| Training Details |
| Data Sources: | |
| Data Volume: | |
| Context Length: | |
| Model Architecture: | |
|
| Input Output |
| Input Format: | | Standard transformer inputs |
|
| Accepted Modalities: | |
| Output Format: | |
| Performance Tips: | | Installation of optional flash-attention for increased efficiency and reduced memory. |
|
|