Mpt 125M C4 by wtang06

 ยป  All LLMs  ยป  wtang06  ยป  Mpt 125M C4   URL Share it on

Mpt 125M C4 is an open-source language model by wtang06. Features: 125m LLM, VRAM: 0.3GB, License: apache-2.0, HF Score: 28.8, LLM Explorer Score: 0.16, Arc: 22.2, HellaSwag: 26.4, MMLU: 24.7, TruthfulQA: 49.1, WinoGrande: 50.7.

  Custom code   Dataset:c4   En   Endpoints compatible   Mpt   Pytorch   Region:us
Model Card on HF ๐Ÿค—: https://huggingface.co/wtang06/mpt-125m-c4 

Mpt 125M C4 Benchmarks

Mpt 125M C4 (wtang06/mpt-125m-c4)
๐ŸŒŸ Advertise your project ๐Ÿš€

Mpt 125M C4 Parameters and Internals

Model Type 
text generation
Use Cases 
Primary Use Cases:
Generating text from a prompt
Considerations:
Intended for research purposes.
Supported Languages 
en (High proficiency)
Training Details 
Data Sources:
HuggingFace C4 dataset
Data Volume:
~2.5B tokens
Training Time:
~1 hour
Hardware Used:
104 A100-40GB GPUs
LLM NameMpt 125M C4
Repository ๐Ÿค—https://huggingface.co/wtang06/mpt-125m-c4 
Model Size125m
Required VRAM0.3 GB
Updated2026-03-29
Maintainerwtang06
Model Typempt
Model Files  0.3 GB
Supported Languagesen
Model ArchitectureMPTForCausalLM
Licenseapache-2.0
Model Max Length2048
Transformers Version4.33.1
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50368
Torch Data Typebfloat16

Best Alternatives to Mpt 125M C4

Best Alternatives
Context / RAM
Downloads
Likes
Ipt 125M0K / 0.3 GB70

Rank the Mpt 125M C4 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52473 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a