| Model Type | |
| Use Cases |
| Areas: | | research, commercial applications |
|
| Primary Use Cases: | | Q&A systems, text generation |
|
| Limitations: | | Potential for generating biased or harmful content |
|
| Considerations: | | Users should ensure outputs comply with ethical and legal standards |
|
|
| Additional Notes | | This model emphasizes safety but acknowledges potential limitations due to probabilistic outputs. |
|
| Supported Languages | | en (excellent), zh (excellent) |
|
| Training Details |
| Data Sources: | | internet corpus, books, code |
|
| Data Volume: | | 2.5 trillion tokens for pre-training |
|
| Methodology: | | Human-aligned training, YaRN interpolation method for position encoding |
|
| Context Length: | |
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | |
|