Name: Bert Small
Author: prajjwal1

Bert Small is an open-source language model by prajjwal1. Features: LLM, VRAM: 0.1GB, Context: 0.5K, License: mit, LLM Explorer Score: 0.04.

Arxiv:1908.08962 Arxiv:2110.01518 Bert En Endpoints compatible Mnli Nli Pre-training Pytorch Region:us

Model Card on HF 🤗: https://huggingface.co/prajjwal1/bert-small

Bert Small Benchmarks

LLME Score: 0.04012

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

🌟 Advertise your project 🚀

Bert Small Parameters and Internals

Model Type

Text Transformer, NLI

Use Cases

Areas:

Natural Language Inference

Applications:

Research, Commercial NLP applications

Primary Use Cases:

Supposed to be trained on downstream tasks such as NLI

Limitations:

Model size is relatively small, hence may not perform as well as its larger counterparts on tasks that require significant model capacity.

Considerations:

Developers should focus on tasks that can leverage the compact size of the model for efficiency.

Additional Notes

Original implementation and additional information can be found in the specified GitHub repository.

Training Details

Data Sources:

Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics

Methodology:

Condensed and compact pre-training followed by downstream task training

Model Architecture:

L=4, H=512

LLM Name	Bert Small
Repository 🤗	https://huggingface.co/prajjwal1/bert-small
Required VRAM	0.1 GB
Updated	2026-03-29
Maintainer	prajjwal1
Model Files	0.1 GB
Supported Languages	en
Model Architecture	AutoModel
License	mit
Context Length	512
Model Max Length	512
Vocabulary Size	30522

Best Alternatives to Bert Small

Best Alternatives	Context / RAM	Downloads	Likes
Distil Longformer Base 4096	4K / 0.4 GB	36	0
Daedalus 1	1K / GB	25	1
Tiny Random Detr	1K / 0.2 GB	21	0
Opengpt2 Pytorch Backward	1K / 6 GB	4	1
Opengpt2 Pytorch Forward	1K / 6 GB	4	1
Finsent Transformer	0.5K / 0.4 GB	0	1
Simbert Chinese Base	0.5K / 0.4 GB	6	0
Bert Chinese L 12 H 768 A 12	0.5K / 0.4 GB	1	1
Simbert Chinese Tiny	0.5K / 0 GB	5	0
Bert Tiny	0.5K / 0 GB	784113	139

Note: green Score (e.g. "73.2") means that the model is better than prajjwal1/bert-small.

Rank the Bert Small Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 52473 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Bert Small by prajjwal1

» All LLMs » prajjwal1 » Bert Small URL Share it on

Bert Small Benchmarks

Bert Small Parameters and Internals

Best Alternatives to Bert Small

Rank the Bert Small Capabilities

What open-source LLMs or SLMs are you in search of? 52473 in total.