Model Type | text-generation, code generation |
|
Use Cases |
Areas: | Research, Code generation |
|
Applications: | |
Primary Use Cases: | Programming assistance, Language modeling |
|
Limitations: | The model may encounter limitations with non-English text and can carry stereotypes and biases., Generated code might have errors, inefficiencies, or potential vulnerabilities. |
|
Considerations: | Attribution might be required for generated code based on the dataset. |
|
|
Additional Notes | The instruction-tuned version in StarChat makes the model a capable assistant. |
|
Supported Languages | English (native), Programming languages (80+) |
|
Training Details |
Data Sources: | RedefinedWeb, StarCoderData, The Stack (v1.2), Wikipedia |
|
Data Volume: | |
Methodology: | Fill-in-the-Middle objective |
|
Context Length: | |
Training Time: | |
Hardware Used: | |
Model Architecture: | GPT-2 model with multi-query attention |
|
|
Responsible Ai Considerations |
Fairness: | The model carries the stereotypes and biases commonly encountered online, given its training data. |
|
Mitigation Strategies: | The code dataset was filtered for permissive licenses only. |
|
|