Mega Ar Large 2048 Simplewiki by pszemraj

 ยป  All LLMs  ยป  pszemraj  ยป  Mega Ar Large 2048 Simplewiki   URL Share it on

Mega Ar Large 2048 Simplewiki is an open-source language model by pszemraj. Features: 216.3m LLM, VRAM: 0.9GB, License: apache-2.0, LLM Explorer Score: 0.09.

Dataset:pszemraj/simple wikipe...   Endpoints compatible   Generated from trainer   Mega   Pytorch   Region:us   Safetensors

Mega Ar Large 2048 Simplewiki Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mega Ar Large 2048 Simplewiki (pszemraj/mega-ar-large-2048-simplewiki)
๐ŸŒŸ Advertise your project ๐Ÿš€

Mega Ar Large 2048 Simplewiki Parameters and Internals

Model Type 
autoregressive, text-generation
Additional Notes 
This model was trained from random weights and trained for three epochs using Simple Wikipedia.
Training Details 
Data Sources:
pszemraj/simple_wikipedia_LM
LLM NameMega Ar Large 2048 Simplewiki
Repository ๐Ÿค—https://huggingface.co/pszemraj/mega-ar-large-2048-simplewiki 
Base Model(s)  pszemraj/random-mega-ar-large   pszemraj/random-mega-ar-large
Model Size216.3m
Required VRAM0.9 GB
Updated2026-03-28
Maintainerpszemraj
Model Typemega
Model Files  0.9 GB   0.9 GB   0.0 GB
Model ArchitectureMegaForCausalLM
Licenseapache-2.0
Model Max Length2048
Transformers Version4.33.1
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50277
Torch Data Typefloat32

Rank the Mega Ar Large 2048 Simplewiki Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52052 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260327b