| Model Type | | text generation, instruction following |
|
| Use Cases |
| Areas: | | general instructions, AI assistants, business applications |
|
| Applications: | | text generation, instruction following |
|
| Primary Use Cases: | | Summarization, Text classification, Text extraction, Question-answering, Retrieval Augmented Generation (RAG), Code related tasks, Function-calling tasks, Multilingual dialog use cases |
|
| Limitations: | | Might not perform equally across all languages as in English., Potential for inaccurate, biased, or unsafe responses without proper safety testing. |
|
| Considerations: | | Proper safety testing and example tuning tailored for specific tasks. |
|
|
| Additional Notes | | The model infrastructure is environmentally friendly, leveraging 100% renewable energy. |
|
| Supported Languages | | English (supported), German (supported), Spanish (supported), French (supported), Japanese (supported), Portuguese (supported), Arabic (supported), Czech (supported), Italian (supported), Korean (supported), Dutch (supported), Chinese (supported) |
|
| Training Details |
| Data Sources: | | publicly available datasets with permissive license, internal synthetic data, human-curated data |
|
| Methodology: | | supervised finetuning, model alignment using reinforcement learning, and model merging |
|
| Context Length: | |
| Hardware Used: | | IBM's supercomputing cluster, Blue Vela with NVIDIA H100 GPUs |
|
| Model Architecture: | | decoder-only sparse Mixture of Experts (MoE) transformer architecture |
|
|
| Responsible Ai Considerations |
| Fairness: | | multilingual data, but primary tuning on English instruction-response pairs. |
|
| Transparency: | | Model developed by Granite Team, IBM. See accompanying technical documentation. |
|
| Mitigation Strategies: | | Introducing few-shot learning for improved accuracy on multilingual tasks. |
|
|
| Input Output |
| Input Format: | | chat template with role, content fields |
|
| Accepted Modalities: | |
| Output Format: | |
| Performance Tips: | | Adjust sequence length as required. |
|
|
| Release Notes |
| Date: | |
| Notes: | | Initial release with instruction tuning and multilingual capabilities. |
|
|
|