OpenBezoar HH RLHF SFT is an open-source language model by SurgeGlobal. Features: 3b LLM, VRAM: 6.8GB, Context: 2K, License: cc-by-nc-4.0, LLM Explorer Score: 0.09.
The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops., This model is not aligned to human preferences and therefore it may generate harmful and uncensored content., Caution is urged against relying on this model for production or adjacent use-cases.
Supported Languages
en ()
Training Details
Data Sources:
Anthropic HH-RLHF Dataset
Data Volume:
First 100K examples
Methodology:
Supervised Fine-Tuning (SFT)
Model Architecture:
OpenLLaMA 3B v2
Input Output
Input Format:
Alpaca prompt template
Performance Tips:
It is important to utilize the Alpaca prompt template in order to obtain best responses for instruction related tasks.
Note: green Score (e.g. "73.2") means that the model is better than SurgeGlobal/OpenBezoar-HH-RLHF-SFT.
Rank the OpenBezoar HH RLHF SFT Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.