What are the hardware requirements for Cerebras GPT 1.3B?

Cerebras GPT 1.3B requires approximately 5.4 GB of VRAM. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Cerebras GPT 1.3B and how large is it?

Cerebras GPT 1.3B is developed by cerebras, a model with 1.3b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

How does Cerebras GPT 1.3B perform on standard benchmarks?

Cerebras GPT 1.3B has the following published scores: MMLU 26.59. Compare against reference models on this page or on the LLM Explorer leaderboards.

Cerebras GPT 1.3B by cerebras — VRAM 5.4GB

Name: Cerebras GPT 1.3B
Author: cerebras

Cerebras GPT 1.3B is an open-source language model by cerebras. Features: 1.3b LLM, VRAM: 5.4GB, License: apache-2.0, LLM Explorer Score: 0.13, Arc: 26.3, HellaSwag: 38.5, MMLU: 26.6, GSM8K: 0.2.

Arxiv:2101.00027 Arxiv:2203.15556 Arxiv:2304.03208 Dataset:the pile Deploy:azure En Endpoints compatible Gpt2 Pytorch Region:us

Model Card on HF 🤗: https://huggingface.co/cerebras/Cerebras-GPT-1.3B

Cerebras GPT 1.3B Benchmarks

ARC: 26.28 vs 96.7 (so35)^-72.8%

HellaSwag: 38.54 vs 95.3 (gpt4)^-59.6%

MMLU: 26.59 vs 88.3 (so35)^-69.9%

GSM8K: 0.23 vs 96.4 (so35)^-99.8%

LLME Score: 0.13037

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Cerebras GPT 1.3B (cerebras/Cerebras-GPT-1.3B)

🌟 Advertise your project 🚀

Cerebras GPT 1.3B Parameters and Internals

Model Type

text generation, causal-lm

Use Cases

Areas:

NLP, Research

Primary Use Cases:

Research into LLMs, Foundation model for NLP applications, ethics, and alignment research.

Supported Languages

English (high)

Training Details

Data Sources:

The Pile

Data Volume:

371B tokens

Methodology:

GPT-3 style architecture, Full attention

Context Length:

2048

Hardware Used:

Cerebras Andromeda AI supercomputer (16 CS-2 wafer scale systems)

Model Architecture:

Transformer-based, GPT-3 style

Safety Evaluation

Ethical Considerations:

Model was trained on the Pile dataset which was analyzed for ethical issues such as toxicity and bias.

Responsible Ai Considerations

Fairness:

Potential for distributional bias from the Pile dataset.

Accountability:

Developers are accountable for the model's outputs when using in production.

Mitigation Strategies:

Standard Pile dataset preprocessing mitigations were employed.

Input Output

Accepted Modalities:

text

LLM Name	Cerebras GPT 1.3B
Repository 🤗	https://huggingface.co/cerebras/Cerebras-GPT-1.3B
Model Size	1.3b
Required VRAM	5.4 GB
Updated	2026-04-27
Maintainer	cerebras
Model Type	gpt2
Model Files	5.4 GB
Supported Languages	en
Model Architecture	AutoModel
License	apache-2.0
Vocabulary Size	50257
Activation Function	gelu

Best Alternatives to Cerebras GPT 1.3B

Best Alternatives	Context / RAM	Downloads	Likes
Diablo Italian Base 1.3B	2K / 2.6 GB	6	0
Diablo Italian Chatbot 1.3B	2K / 2.6 GB	9	0
GPT Neo X 1.3B Qlora Test	0K / 0 GB	0	1
...lb 200 Distilled 1.3B Ct2 Int8	0K / 1.4 GB	130	6
...pseek Coder 1.3B Instruct GGUF	0K / 0.6 GB	27663	54
Deepseek Coder 1.3B Base GGUF	0K / 0.6 GB	3986	9
...eared LLaMA 1.3B ShareGPT GGUF	0K / 0.6 GB	289	2

Note: green Score (e.g. "73.2") means that the model is better than cerebras/Cerebras-GPT-1.3B.

Rank the Cerebras GPT 1.3B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 53999 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Cerebras GPT 1.3B by cerebras

» All LLMs » cerebras » Cerebras GPT 1.3B URL Share it on

Cerebras GPT 1.3B Benchmarks

Cerebras GPT 1.3B Parameters and Internals

Best Alternatives to Cerebras GPT 1.3B

Rank the Cerebras GPT 1.3B Capabilities

What open-source LLMs or SLMs are you in search of? 53999 in total.