This version 1.3 of the model has issues and a new release, version 1.4, is anticipated.
Training Details
Data Sources:
Synthetic data created with OpenAI GPT-4 via Airoboros.
Methodology:
Fine-tuned with a fork of qlora, using a slightly modified vicuna template. Utilized a special training method involving synthetic generation and QLoRA fine-tuning.
Context Length:
2800
Model Architecture:
A 65 billion parameter version of LlaMa using QLoRA for efficient fine-tuning.
Input Output
Input Format:
A chat format with 'USER: ' followed by input and 'ASSISTANT: ' for responses.
Note: green Score (e.g. "73.2") means that the model is better than jondurbin/airoboros-65b-gpt4-1.3.
Rank the Airoboros 65B Gpt4 1.3 Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.