Aira 2 Portuguese 560M is an open-source language model by nicholasKluge. Features: 560m LLM, License: bigscience-bloom-rail-1.0, Instruction-Based, LLM Explorer Score: 0.08.
Hallucinations, Biases and toxicity, Repetition and verbosity
Additional Notes
The model aims to generate accurate in-context responses; however, care should be taken regarding its well-documented limitations.
Supported Languages
Portuguese (Native with all dialects)
Training Details
Data Sources:
nicholasKluge/instruct-aira-dataset
Methodology:
The model was trained with a dataset composed of prompts and completions generated synthetically by prompting already-tuned models (ChatGPT, Llama, Open-Assistant, etc).
Hardware Used:
1 NVIDIA A100-SXM4-40GB
Model Architecture:
Based on BLOOM
Input Output
Input Format:
String with special tokens
Accepted Modalities:
Text
Output Format:
Text responses
Performance Tips:
Ensure repetition penalty, temperature, top_k, and top_p parameters are set to prevent repetitive or verbose outputs.
Release Notes
Version:
1.0
Date:
2023
Notes:
Initial release with instruction-tuned capabilities and text generation.
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 51634 in total.