Pythia 2.8B Deduped by EleutherAI

 ยป  All LLMs  ยป  EleutherAI  ยป  Pythia 2.8B Deduped   URL Share it on

  Arxiv:2101.00027   Arxiv:2201.07311   Arxiv:2304.01373   Autotrain compatible Dataset:eleutherai/the pile de...   En   Endpoints compatible   Gpt neox   Pythia   Pytorch   Region:us   Safetensors

Pythia 2.8B Deduped Benchmarks

Pythia 2.8B Deduped (EleutherAI/pythia-2.8b-deduped)
๐ŸŒŸ Advertise your project ๐Ÿš€

Pythia 2.8B Deduped Parameters and Internals

Model Type 
Transformer-based Language Model
Use Cases 
Areas:
Research
Applications:
Interpretability research
Primary Use Cases:
Scientific experiments on language models, Promoting interpretability research
Limitations:
Not suitable for deployment, Not suitable for translation or generating text in languages other than English
Considerations:
Please conduct your own risk and bias assessment when applying the model.
Additional Notes 
Please note that all models in the *Pythia* suite were renamed in January 2023.
Training Details 
Data Sources:
EleutherAI/the_pile_deduplicated
LLM NamePythia 2.8B Deduped
Repository ๐Ÿค—https://huggingface.co/EleutherAI/pythia-2.8b-deduped 
Model Size2.8b
Required VRAM5.7 GB
Updated2025-07-30
MaintainerEleutherAI
Model Typegpt_neox
Model Files  5.7 GB   5.7 GB
Supported Languagesen
Model ArchitectureGPTNeoXForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.24.0
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50304
Torch Data Typefloat16

Quantized Models of the Pythia 2.8B Deduped

Model
Likes
Downloads
VRAM
Pythia 2.8B Deduped GPTQ091 GB

Best Alternatives to Pythia 2.8B Deduped

Best Alternatives
Context / RAM
Downloads
Likes
Pythia 2.8B Thai Base V12K / 5.9 GB432
Hh Full2K / 11.1 GB50
... Llm Pythia 2.8B Pm Gen Ian Nd2K / 11.1 GB60
...thia 2.8B Mitchell Sft Hh Rlhf2K / 11.1 GB140
Pythia 2.8B Sft Hh Rlhf2K / 5.6 GB70
TLDR Pythia2.8B SFT2K / 11.1 GB50
...ythia 2.8B Helpful Sft 3epochs2K / 5.6 GB140
Indication PYT V22K / 5.7 GB160
Indication PYT V22K / 5.7 GB120
...I Pythia 2.8B Deduped Sft Tldr2K / 5.7 GB560
Note: green Score (e.g. "73.2") means that the model is better than EleutherAI/pythia-2.8b-deduped.

Rank the Pythia 2.8B Deduped Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50230 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124