OpenBezoar HH RLHF SFT by SurgeGlobal

 »  All LLMs  »  SurgeGlobal  »  OpenBezoar HH RLHF SFT   URL Share it on

OpenBezoar HH RLHF SFT is an open-source language model by SurgeGlobal. Features: 3b LLM, VRAM: 6.8GB, Context: 2K, License: cc-by-nc-4.0, LLM Explorer Score: 0.09.

  Arxiv:2306.02707   Arxiv:2404.12195 Base model:finetune:surgegloba... Base model:surgeglobal/openbez...   Dataset:anthropic/hh-rlhf   En   Endpoints compatible   Llama   Pytorch   Region:us   Safetensors

OpenBezoar HH RLHF SFT Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

OpenBezoar HH RLHF SFT Parameters and Internals

Model Type 
text generation
Use Cases 
Limitations:
The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops., This model is not aligned to human preferences and therefore it may generate harmful and uncensored content., Caution is urged against relying on this model for production or adjacent use-cases.
Supported Languages 
en ()
Training Details 
Data Sources:
Anthropic HH-RLHF Dataset
Data Volume:
First 100K examples
Methodology:
Supervised Fine-Tuning (SFT)
Model Architecture:
OpenLLaMA 3B v2
Input Output 
Input Format:
Alpaca prompt template
Performance Tips:
It is important to utilize the Alpaca prompt template in order to obtain best responses for instruction related tasks.
LLM NameOpenBezoar HH RLHF SFT
Repository 🤗https://huggingface.co/SurgeGlobal/OpenBezoar-HH-RLHF-SFT 
Base Model(s)  OpenBezoar SFT   SurgeGlobal/OpenBezoar-SFT
Model Size3b
Required VRAM6.8 GB
Updated2026-05-16
MaintainerSurgeGlobal
Model Typellama
Model Files  6.8 GB   6.8 GB
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensecc-by-nc-4.0
Context Length2048
Model Max Length2048
Transformers Version4.33.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to OpenBezoar HH RLHF SFT

Best Alternatives
Context / RAM
Downloads
Likes
Nanbeige4.1 3B256K / 7.9 GB1728321108
Nanbeige4.1 3B Heretic256K / 7.9 GB39350
Nanbeige4.1 3B Bf16256K / 7.9 GB713
Nanbeige4.1 3B Heretic Mxfp4256K / 7.9 GB302
Nanbeige4.1 3B Heretic256K / 7.9 GB281
Nanbeige4.1 3B Heretic256K / 7.9 GB60
ISA 03 Mini 3B Hybrid Preview256K / 6.5 GB8514
Llama 3.2 3B Instruct128K / 6.5 GB22052122139
Llama 3.2 3B128K / 6.5 GB1183471785
Llama 3.2 3B Instruct128K / 6.5 GB12989
Note: green Score (e.g. "73.2") means that the model is better than SurgeGlobal/OpenBezoar-HH-RLHF-SFT.

Rank the OpenBezoar HH RLHF SFT Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53972 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a