DialoGPT Medium by microsoft

 ยป  All LLMs  ยป  microsoft  ยป  DialoGPT Medium   URL Share it on

  Arxiv:1911.00536   Autotrain compatible   Conversational   Endpoints compatible   Gpt2   Jax   Pytorch   Region:us   Rust   Tf

DialoGPT Medium Benchmarks

DialoGPT Medium (microsoft/DialoGPT-medium)
๐ŸŒŸ Advertise your project ๐Ÿš€

DialoGPT Medium Parameters and Internals

Model Type 
Conversational, Dialogue Response Generation
Use Cases 
Areas:
Research, Commercial applications
Applications:
Chatbots, Virtual assistants
Primary Use Cases:
Dialogue generation
Additional Notes 
Human evaluation results indicate high response quality.
Supported Languages 
English (High proficiency)
Training Details 
Data Sources:
Reddit discussion thread
Data Volume:
147M multi-turn dialogues
Methodology:
Pretraining on large-scale dialogue data
Input Output 
Input Format:
User input encoded with tokenizer
Accepted Modalities:
Text
Output Format:
Generated dialogue response
LLM NameDialoGPT Medium
Repository ๐Ÿค—https://huggingface.co/microsoft/DialoGPT-medium 
Required VRAM0.9 GB
Updated2025-08-28
Maintainermicrosoft
Model Typegpt2
Model Files  0.9 GB
Model ArchitectureGPT2LMHeadModel
Licensemit
Model Max Length1024
Tokenizer ClassGPT2Tokenizer
Vocabulary Size50257
Activation Functiongelu_new
Errorsreplace

Best Alternatives to DialoGPT Medium

Best Alternatives
Context / RAM
Downloads
Likes
Gpt2 Base Bne0K / 0.5 GB17360
Chatbench Distilgpt20K / 0.3 GB322
BrtGPT 1 Pre0K / 0.4 GB1320
GPT 2 Large 115K Steps0K / 0 GB50
GPT 2 Large 51K Steps0K / 0 GB70
GPT 2 Large 32K Steps0K / 0 GB80
GPT 2 Large 40K Steps0K / 0 GB50
GPT 2 Large 20K Steps0K / 0 GB80
GPT 2 Large 43K Steps0K / 0 GB90
Distilgpt2 Openvino0K / 0.3 GB70
Note: green Score (e.g. "73.2") means that the model is better than microsoft/DialoGPT-medium.

Rank the DialoGPT Medium Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50968 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124