| Model Type | |
| Use Cases |
| Areas: | | research, commercial applications |
|
| Applications: | | chatbots, text generation tasks |
|
| Primary Use Cases: | |
| Limitations: | | Performance in languages other than French and English is not guaranteed., Degradation in performance due to change in datatype from float16 to bfloat16. |
|
|
| Additional Notes | | The tokenizer designed for multilingual contexts improves efficiency. |
|
| Supported Languages | | French (fluent), English (fluent) |
|
| Training Details |
| Data Sources: | | ehartford/wizard_vicuna_70k_unfiltered, shahules786/orca-chat, timdettmers/openassistant-guanaco, laion/OIG |
|
| Data Volume: | |
| Methodology: | | Fine-tuned on French and English data |
|
| Context Length: | |
| Hardware Used: | | 1 x A100 40GB, 4 x A100 40GB |
|
| Model Architecture: | | Transposition from float16 to bfloat16 for improved efficiency |
|
|
| Input Output |
| Input Format: | |
| Accepted Modalities: | |
| Output Format: | | Text response with chatbot capabilities |
|
| Performance Tips: | | Precede individual prompt by EOS token (</s>) and generated part by BOS token (<s>). |
|
|
| Release Notes |
| Version: | |
| Date: | |
| Notes: | | Fine-tuned model for chatbot applications in French and English. |
|
|
|