Kitten 110M by qikp

 ยป  All LLMs  ยป  qikp  ยป  Kitten 110M   URL Share it on

Kitten 110M is an open-source language model by qikp. Features: 111m LLM, VRAM: 0.4GB, License: apache-2.0.

Base model:cerebras/cerebras-g... Base model:finetune:cerebras/c... Dataset:huggingfacetb/cosmoped...   En   Gpt2   Region:us   Safetensors
Model Card on HF ๐Ÿค—: https://huggingface.co/qikp/kitten-110m 
Kitten 110M (qikp/kitten-110m)
๐ŸŒŸ Advertise your project ๐Ÿš€

Kitten 110M Parameters and Internals

LLM NameKitten 110M
Repository ๐Ÿค—https://huggingface.co/qikp/kitten-110m 
Base Model(s)  Cerebras GPT 111M   cerebras/Cerebras-GPT-111M
Model Size111m
Required VRAM0.4 GB
Updated2026-03-31
Maintainerqikp
Model Typegpt2
Model Files  0.4 GB   0.0 GB
Supported Languagesen
Model ArchitectureGPT2LMHeadModel
Licenseapache-2.0
Transformers Version5.0.0
Tokenizer ClassGPT2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size50257
Activation Functiongelu
Errorsreplace

Best Alternatives to Kitten 110M

Best Alternatives
Context / RAM
Downloads
Likes
Testmodel0K / 0.3 GB18801
111M0K / 0.3 GB18612
Cerebras GPT 111M Instruction0K / 0.4 GB283
LaMini Cerebras 111M0K / 0.5 GB493
Note: green Score (e.g. "73.2") means that the model is better than qikp/kitten-110m.

Rank the Kitten 110M Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52334 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a