Pythia 410M Deduped is an open-source language model by EleutherAI. Features: 410m LLM, VRAM: 0.9GB, Context: 2K, License: apache-2.0, HF Score: 31.3, LLM Explorer Score: 0.15, Arc: 24.8, HellaSwag: 41.3, MMLU: 26, TruthfulQA: 41, WinoGrande: 54.4, GSM8K: 0.3.
Pythia 410M Deduped Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Pythia 410M Deduped Parameters and Internals
Model Type Transformer-based Language Model, Causal Language Modeling
Use Cases
Areas: Research, Scientific Experiments
Applications: Interpretability Research
Primary Use Cases: Analyzing behavior and functionality of large language models
Limitations: Not suitable for translation or non-English text generation, Not intended for deployment in human-facing interactions
Considerations: Text generated may be socially unacceptable or undesirable. Users should conduct risk assessments.
Additional Notes Model checkpoints are available on Hugging Face hosted as branches for further fine-tuning.
Supported Languages languages (.English), proficiency (.High)
Training Details
Data Sources: The Pile (globally deduplicated)
Data Volume:
Model Architecture:
Safety Evaluation
Methodologies:
Risk Categories:
Ethical Considerations: The model is trained on the Pile, which is known to contain profanity and offensive text.
Responsible Ai Considerations
Fairness: The Pile contains biases related to gender, religion, and race. Users should conduct their own risk and bias assessments before deployment.
Accountability: EleutherAI is responsible for the training and release of the model.
Mitigation Strategies: None provided directly; users are advised to curate model outputs before presentation.
Input Output
Input Format: Text input for causal language modeling.
Accepted Modalities:
Output Format: Text generation as the next token prediction.
Performance Tips: Fine-tune appropriately; ensure model outputs are curated before use.
Release Notes
Version:
Date:
Notes: Renaming of models, retrained with uniform batch sizes and checkpoints.
Version:
Notes: Initial release of models with hyperparameter discrepancies.
Best Alternatives to Pythia 410M Deduped
Note: green Score (e.g. "73.2 ") means that the model is better than EleutherAI/pythia-410m-deduped .
Expand
Rank the Pythia 410M Deduped Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum โ our secure, self-hosted AI agent for server management.
Release v20260328a