Foundational models for application-specific fine-tuning
Limitations:
The pre-training dataset may contain offensive or inappropriate content., Exercise caution in production systems., Do not use for applications causing harm or distress.
Additional Notes
Models have been superseded by later releases.
Supported Languages
en (fluent)
Training Details
Data Sources:
New experimental dataset based on The Pile
Data Volume:
Approximately 1.5 trillion tokens
Methodology:
Auto-regressive language model based on NeoX transformer architecture
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.