Deployments, including those in additional languages, must adhere to safety guidelines and responsible use principles.
Additional Notes
This is a static model trained on an offline dataset. Future versions may be released that improve model capabilities and safety.
Supported Languages
English (official), German (official), French (official), Italian (official), Portuguese (official), Hindi (official), Spanish (official), Thai (official)
Training Details
Data Sources:
A new mix of publicly available online data
Data Volume:
Up to 9 trillion tokens
Context Length:
128000
Hardware Used:
Meta's custom built GPU cluster
Model Architecture:
Auto-regressive with an optimized transformer architecture
Responsible Ai Considerations
Mitigation Strategies:
Llama 3.2 was developed following best practices outlined in Meta's Responsible Use Guide.
Note: green Score (e.g. "73.2") means that the model is better than meta-llama/Llama-3.2-1B.
Rank the Llama 3.2 1B Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.