| Model Type | | Decoder-only Transformer language model |
|
| Use Cases |
| Limitations: | | Exhibits biases present in its training data, Performance on tasks not in the evaluation suite may vary, Limited to training data cutoff date |
|
|
| Additional Notes | | The model has not undergone specific alignment or safety fine-tuning. |
|
| Supported Languages | |
| Training Details |
| Data Sources: | | DCLM-BASELINE, StarCoder, ProofPile2 |
|
| Data Volume: | |
| Context Length: | |
| Hardware Used: | |
| Model Architecture: | |
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | |
|