Model Type | auto-regressive language model |
|
Use Cases |
Areas: | research on large language models |
|
Primary Use Cases: | question answering, natural language understanding, reading comprehension |
|
Limitations: | generation of misinformation, generation of harmful, biased or offensive content |
|
Considerations: | The model is for research purposes and not recommended for use in downstream applications without further evaluations. |
|
|
Supported Languages | bg (unknown), ca (unknown), cs (unknown), da (unknown), de (unknown), en (unknown), es (unknown), fr (unknown), hr (unknown), hu (unknown), it (unknown), nl (unknown), pl (unknown), pt (unknown), ro (unknown), ru (unknown), sl (unknown), sr (unknown), sv (unknown), uk (unknown) |
|
Training Details |
Data Sources: | CCNet, C4, GitHub, Wikipedia, Books, ArXiv, Stack Exchange |
|
Methodology: | |
Training Time: | between Dec. 2022 and Feb. 2023 |
|
|
Responsible Ai Considerations |
Fairness: | Biases have been evaluated. |
|
Mitigation Strategies: | Filtered the data from the Web based on its proximity to Wikipedia. Used Kneser-Ney language model and fastText linear classifier. |
|
|