| Model Type | |
| Use Cases |
| Areas: | | Research, Commercial applications |
|
| Applications: | | Content Creation, Communication, Chatbots, Conversational AI, Text Summarization, NLP Research, Language Learning Tools, Knowledge Exploration |
|
| Primary Use Cases: | |
| Limitations: | | Bias in training data, Context and task complexity, Language ambiguity and nuance, Factual inaccuracy, Lack of common sense |
|
| Considerations: | | Aware of potential biases and misuse. |
|
|
| Supported Languages | | English (high proficiency) |
|
| Training Details |
| Data Sources: | | Web Documents, Code, Mathematics |
|
| Data Volume: | | 8 trillion tokens for the 9B model |
|
| Hardware Used: | | Tensor Processing Unit (TPU) |
|
| Model Architecture: | | text-to-text, decoder-only large language model |
|
|
| Safety Evaluation |
| Methodologies: | | structured evaluations, internal red-teaming testing |
|
| Risk Categories: | | Text-to-Text Content Safety, Representational Harms, Memorization, Large-scale harm |
|
| Ethical Considerations: | | Met acceptable thresholds for safety. |
|
|
| Responsible Ai Considerations |
| Fairness: | | Efforts to address biases through curriculum and evaluation. |
|
| Transparency: | | Model card provides details on architecture, capabilities, limitations, and evaluation processes. |
|
| Accountability: | |
| Mitigation Strategies: | | Continuous monitoring, de-biasing techniques, guidelines for responsible use. |
|
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | | Generated English-language text |
|
| Performance Tips: | | Use appropriate prompts for improved context. |
|
|