| Model Type | |
| Use Cases |
| Areas: | | research, foundation for specialization |
|
| Applications: | | summarization, text generation, chatbot |
|
| Limitations: | | generalization issues to unsupported languages, biases from web-corpora |
|
| Considerations: | | Tailor to specific tasks and assessments. |
|
|
| Additional Notes | | This model was part of a merge using mergekit with specific layer configurations. |
|
| Supported Languages | | Portuguese (native), English (proficient), German (proficient), Spanish (proficient), French (proficient), Italian (proficient), Polish (proficient), Dutch (proficient), Romanian (proficient), Czech (proficient), Swedish (proficient) |
|
| Training Details |
| Data Sources: | | wikimedia/wikipedia Portuguese subset |
|
| Methodology: | | continued pre-training and pruning using layer similarity to maintain performance while reducing model size |
|
|
| Safety Evaluation |
| Methodologies: | | PruneMe layer similarity analysis |
|
| Risk Categories: | | bias, generalization issues |
|
|
| Responsible Ai Considerations |
| Mitigation Strategies: | | Fine-tuning and guardrails recommended for production use. |
|
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | |
| Performance Tips: | | Use PyTorch 2.0 for optimal inference with Falcon models |
|
|
| Release Notes |
| Notes: | | Merged using passthrough method and pruned with PruneMe for optimized performance. |
|
|
|