| Model Type | |
| Use Cases | |
| Additional Notes | | OLMo is a series of open language models. |
|
| Supported Languages | |
| Training Details |
| Data Sources: | | allenai/tulu-3-sft-olmo-2-mixture, allenai/olmo-2-1124-13b-preference-mix, allenai/RLVR-GSM-MATH-IF-Mixed-Constraints |
|
| Methodology: | | supervised finetuning on TΓΌlu 3 dataset, DPO training, RLVR training |
|
|
| Safety Evaluation |
| Risk Categories: | |
| Ethical Considerations: | | Limited safety training, potential for problematic outputs. |
|
|
| Input Output |
| Accepted Modalities: | |
| Output Format: | |
|
| Release Notes |
| Date: | |
| Notes: | | Post-trained variant with RLVR training. |
|
|
|