BabyMistral is an open-source language model by OEvortex. Features: 1.6b LLM, VRAM: 0.9GB, Context: 8K, License: apache-2.0, Quantized, LLM Explorer Score: 0.15.
Text completion and generation, Creative writing assistance, Dialogue systems, Question answering, Language understanding tasks
Limitations:
May struggle with very specialized or technical domains, Lacks real-time knowledge beyond its training data, Potential for generating plausible-sounding but incorrect information
Supported Languages
en (proficient)
Training Details
Data Sources:
1.5 trillion tokens
Data Volume:
1.5 trillion tokens
Methodology:
Trained from scratch
Training Time:
70 days
Hardware Used:
4x NVIDIA A100 GPUs
Model Architecture:
Based on Mistral
Responsible Ai Considerations
Fairness:
The model may reproduce biases present in its training data.
Mitigation Strategies:
Generated content should be reviewed for accuracy and appropriateness.
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52758 in total.