Gpt2 NoLN by apollo-research

 »  All LLMs  »  apollo-research  »  Gpt2 NoLN   URL Share it on

Gpt2 NoLN is an open-source language model by apollo-research. Features: 124.4m LLM, VRAM: 0.5GB, LLM Explorer Score: 0.15.

  Arxiv:2409.13710   Deploy:azure   Endpoints compatible   Gpt2   Region:us   Safetensors

Gpt2 NoLN Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Gpt2 NoLN Parameters and Internals

Model Type 
GPT2LMHeadModel
Additional Notes 
To fully remove all LayerNorms, replace 'ln_1' and 'ln_2' modules with identities, and modify 'ln_f' with adjustments to the unembed matrix and bias.
Training Details 
Data Sources:
OpenWebText
Data Volume:
~500M tokens
Methodology:
Fine-tuning with gradual LayerNorm disabling
Context Length:
1024
Release Notes 
Version:
v2
Notes:
Trained for 1000 iterations in a single training run
Version:
v1
Notes:
Trained for 900 iterations, with multiple interruptions, modifying LNs, and resume steps
LLM NameGpt2 NoLN
Repository 🤗https://huggingface.co/apollo-research/gpt2_noLN 
Model Size124.4m
Required VRAM0.5 GB
Updated2026-05-11
Maintainerapollo-research
Model Typegpt2
Model Files  0.5 GB
Model ArchitectureGPT2LMHeadModel
Transformers Version4.42.4
Vocabulary Size50257
Torch Data Typefloat32
Activation Functiongelu_new

Best Alternatives to Gpt2 NoLN

Best Alternatives
Context / RAM
Downloads
Likes
Tinystories 50900K / 0.2 GB540
Gpt2 Scratch0K / 0.5 GB100
Gpt2 Irish Folk Tune Generator0K / 0.5 GB441
My Story Generator0K / 0.5 GB282
CalmaCatLM 2 Mini0K / 0.5 GB01
Phrase To Story Generator0K / 0.5 GB50
Gpt2 Hoodie Final0K / 0.5 GB70
Autotrain Be6vh G5hv90K / 0.5 GB70
Gpt2 Sft0K / 0.5 GB50
ArshGpt0K / 0.5 GB2212
Note: green Score (e.g. "73.2") means that the model is better than apollo-research/gpt2_noLN.

Rank the Gpt2 NoLN Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53999 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a