OpenELM 450M Instruct is an open-source language model by apple. Features: 450m LLM, VRAM: 0.9GB, License: apple-amlr, Instruction-Based, LLM Explorer Score: 0.15.
Users should conduct thorough safety testing and implement appropriate filtering.
Additional Notes
Supports pre-trained and instruction-tuned models with sizes varying from 270M to 3B parameters. Package includes data prep, training, fine-tuning, evaluation, checkpoints, and logs.
Training Details
Data Sources:
RefinedWeb, deduplicated PILE, subset of RedPajama, subset of Dolma v1.6
Data Volume:
~1.8 trillion tokens
Methodology:
layer-wise scaling strategy within transformer layers
Input Output
Input Format:
Plain text
Accepted Modalities:
text
Output Format:
Generated text
Performance Tips:
Use appropriate batch sizes and token speculation for faster generation.
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.