Pythia 410M Deduped By EleutherAI: Benchmarks, Features and Detailed Analysis. Insights on Pythia 410M Deduped.

Arxiv:2101.00027 Arxiv:2201.07311 Arxiv:2304.01373 Dataset:eleutherai/the pile de... Deploy:azure En Endpoints compatible Gpt neox Pythia Pytorch Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/EleutherAI/pythia-410m-deduped

Pythia 410M Deduped Benchmarks

ARC: 24.83 vs 96.7 (so35)^-74.3%

HellaSwag: 41.29 vs 95.3 (gpt4)^-56.7%

MMLU: 25.99 vs 88.3 (so35)^-70.6%

TruthfulQA: 40.95 vs 59 (gpt4)^-30.6%

WinoGrande: 54.38 vs 87.5 (gpt4)^-37.9%

GSM8K: 0.3 vs 96.4 (so35)^-99.7%

LLME Score: 0.15353

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Pythia 410M Deduped (EleutherAI/pythia-410m-deduped)

🌟 Advertise your project 🚀

Pythia 410M Deduped Parameters and Internals

Model Type

Transformer-based Language Model, Causal Language Modeling

Use Cases

Areas:

Research, Scientific Experiments

Applications:

Interpretability Research

Primary Use Cases:

Analyzing behavior and functionality of large language models

Limitations:

Not suitable for translation or non-English text generation, Not intended for deployment in human-facing interactions

Considerations:

Text generated may be socially unacceptable or undesirable. Users should conduct risk assessments.

Additional Notes

Model checkpoints are available on Hugging Face hosted as branches for further fine-tuning.

Supported Languages

languages (.English), proficiency (.High)

Training Details

Data Sources:

The Pile (globally deduplicated)

Data Volume:

.825 GiB

Model Architecture:

Transformer-based

Safety Evaluation

Methodologies:

LM Evaluation Harness

Risk Categories:

Misinformation, Bias

Ethical Considerations:

The model is trained on the Pile, which is known to contain profanity and offensive text.

Responsible Ai Considerations

Fairness:

The Pile contains biases related to gender, religion, and race. Users should conduct their own risk and bias assessments before deployment.

Accountability:

EleutherAI is responsible for the training and release of the model.

Mitigation Strategies:

None provided directly; users are advised to curate model outputs before presentation.

Input Output

Input Format:

Text input for causal language modeling.

Accepted Modalities:

Text

Output Format:

Text generation as the next token prediction.

Performance Tips:

Fine-tune appropriately; ensure model outputs are curated before use.

Release Notes

Version:

Current release

Date:

January 2023

Notes:

Renaming of models, retrained with uniform batch sizes and checkpoints.

Version:

Early release

Notes:

Initial release of models with hyperparameter discrepancies.

LLM Name	Pythia 410M Deduped
Repository 🤗	https://huggingface.co/EleutherAI/pythia-410m-deduped
Model Size	410m
Required VRAM	0.9 GB
Updated	2026-02-09
Maintainer	EleutherAI
Model Type	gpt_neox
Model Files	0.9 GB 0.9 GB
Supported Languages	en
Model Architecture	GPTNeoXForCausalLM
License	apache-2.0
Context Length	2048
Model Max Length	2048
Transformers Version	4.24.0
Tokenizer Class	GPTNeoXTokenizer
Vocabulary Size	50304
Torch Data Type	float16

Best Alternatives to Pythia 410M Deduped

Best Alternatives	Context / RAM	Downloads	Likes
...thia 410M Cell Type Prediction	4K / 0 GB	1639	6
Pythia 410M Sft Full	2K / 0.8 GB	7	0
Pythia 410M	2K / 0.9 GB	38612	35
Healix 410M	2K / 1.6 GB	594	0
Pythia 410M Deduped SimPOW 0	2K / 0.8 GB	5	0
Pythia 410M Orpo	2K / 1.6 GB	5	0
...7 Kl 01 Steps 12000 Rlhf Model	2K / 1.6 GB	5	0
Pythia410m Sft Tldr	2K / 1.6 GB	296	0
Pythia 410M Ludii Sft	2K / 1.6 GB	5	0
... Llm Pythia 410M Pm Gen Ian Nd	2K / 1.6 GB	5	0

Note: green Score (e.g. "73.2") means that the model is better than EleutherAI/pythia-410m-deduped.

Rank the Pythia 410M Deduped Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 51611 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241124

Support LLM Explorer

Pythia 410M Deduped by EleutherAI

» All LLMs » EleutherAI » Pythia 410M Deduped URL Share it on

Pythia 410M Deduped Benchmarks

Pythia 410M Deduped Parameters and Internals

Best Alternatives to Pythia 410M Deduped

Rank the Pythia 410M Deduped Capabilities

What open-source LLMs or SLMs are you in search of? 51611 in total.