PairRM By llm-blender: Benchmarks, Features and Detailed Analysis. Insights on PairRM.

Arxiv:2112.09332 Arxiv:2306.02561 Dataset:anthropic/hh-rlhf Dataset:dahoas/synthetic-instr... Dataset:lmsys/chatbot arena co... Dataset:openai/summarize from ... Dataset:openai/webgpt comparis... Dataset:openbmb/ultrafeedback Deberta En Endpoints compatible Evaluation Instruct Instruction Region:us Reranking Reward-model Reward model Rlhf Safetensors

Model Card on HF 🤗: https://huggingface.co/llm-blender/PairRM

PairRM Benchmarks

LLME Score: 0.1575

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

🌟 Advertise your project 🚀

PairRM Parameters and Internals

Model Type

reward_model, evaluation, reranking, instruction

Use Cases

Areas:

research, commercial applications

Applications:

LLM evaluation, decoding enhancement, instruction alignment

Primary Use Cases:

ranking output candidates, enhancing decoding processes, aligning models with RLHF methods

Supported Languages

en (proficient)

Training Details

Data Sources:

openai/summarize_from_feedback, openai/webgpt_comparisons, Dahoas/synthetic-instruct-gptj-pairwise, Anthropic/hh-rlhf, lmsys/chatbot_arena_conversations, openbmb/UltraFeedback

Methodology:

Pairwise comparison approach with bidirectional attention

Context Length:

2048

Hardware Used:

super-efficient hardware

Model Architecture:

Pairwise comparison through bidirectional attention

Input Output

Input Format:

Instruction and a pair of output candidates

Accepted Modalities:

text

Output Format:

Score for each candidate

LLM Name	PairRM
Repository 🤗	https://huggingface.co/llm-blender/PairRM
Model Name	microsoft/deberta-v3-large
Model Size	436m
Required VRAM	1.7 GB
Updated	2025-09-23
Maintainer	llm-blender
Model Type	deberta
Instruction-Based	Yes
Model Files	1.7 GB 0.0 GB
Supported Languages	en
Model Architecture	AutoModel
License	mit
Tokenizer Class	DebertaV2Tokenizer
Padding Token	[PAD]

Best Alternatives to PairRM

Best Alternatives	Context / RAM	Downloads	Likes
Autotrain Umberto Proclama	0K / 0.9 GB	5	0
Mamba Python	0K / 2 GB	13	0
...l 8x7B Instruct V0.1 Llamafile	0K / GB	3180	18
...hi 3 Mini 4K Instruct Ct2 Int8	0K / 3.8 GB	5	1
...hin 2.5 Mixtral 8x7b Llamafile	0K / GB	933	4
CSUMLM	0K / GB	6	1
Instruct GPT J	0K / 0 GB	0	26
Vigogne Bloom 7b1 Instruct	0K / 0.1 GB	0	4
...a Instruction Fine Tune French	0K / 0 GB	0	4
MiniMaid L2	0K / 0 GB	6	2

Note: green Score (e.g. "73.2") means that the model is better than llm-blender/PairRM.

Rank the PairRM Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 51560 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241124

Support LLM Explorer

PairRM by llm-blender

» All LLMs » llm-blender » PairRM URL Share it on

PairRM Benchmarks

PairRM Parameters and Internals

Best Alternatives to PairRM

Rank the PairRM Capabilities

What open-source LLMs or SLMs are you in search of? 51560 in total.