| Model Type | | text-to-text, decoder-only, large language model |
|
| Use Cases |
| Areas: | | Research, Commercial applications |
|
| Applications: | | Content Creation and Communication, Research and Education |
|
| Primary Use Cases: | | Text Generation, Chatbots and Conversational AI, Text Summarization |
|
| Limitations: | | Open-ended or highly complex tasks, Language ambiguity and nuance, Factual inaccuracy, Common sense reasoning |
|
|
| Additional Notes | | Trained using JAX and ML Pathways. |
|
| Supported Languages | |
| Training Details |
| Data Sources: | | Web Documents, Code, Mathematics |
|
| Data Volume: | |
| Hardware Used: | |
|
| Safety Evaluation |
| Methodologies: | | Red-teaming, Internal evaluations |
|
| Findings: | | Within acceptable thresholds for internal policies |
|
| Risk Categories: | | Text-to-Text Content Safety, Text-to-Text Representational Harms, Memorization, Large-scale harm |
|
|
| Responsible Ai Considerations |
| Fairness: | | Careful scrutiny and pre-processing of input data |
|
| Transparency: | | Model card summarizes the model's details |
|
| Accountability: | | Model creators accountable for evaluation processes |
|
| Mitigation Strategies: | | Continuous monitoring and exploration of de-biasing techniques |
|
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | | Generated English-language text |
|
| Performance Tips: | | Use bfloat16 or float16 for better performance on compatible hardware. |
|
|