Bert Small by prajjwal1

 ยป  All LLMs  ยป  prajjwal1  ยป  Bert Small   URL Share it on

Bert Small is an open-source language model by prajjwal1. Features: LLM, VRAM: 0.1GB, Context: 0.5K, License: mit, LLM Explorer Score: 0.04.

  Arxiv:1908.08962   Arxiv:2110.01518   Bert   En   Endpoints compatible   Mnli   Nli   Pre-training   Pytorch   Region:us
Model Card on HF ๐Ÿค—: https://huggingface.co/prajjwal1/bert-small 

Bert Small Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Bert Small (prajjwal1/bert-small)
๐ŸŒŸ Advertise your project ๐Ÿš€

Bert Small Parameters and Internals

Model Type 
Text Transformer, NLI
Use Cases 
Areas:
Natural Language Inference
Applications:
Research, Commercial NLP applications
Primary Use Cases:
Supposed to be trained on downstream tasks such as NLI
Limitations:
Model size is relatively small, hence may not perform as well as its larger counterparts on tasks that require significant model capacity.
Considerations:
Developers should focus on tasks that can leverage the compact size of the model for efficiency.
Additional Notes 
Original implementation and additional information can be found in the specified GitHub repository.
Training Details 
Data Sources:
Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics
Methodology:
Condensed and compact pre-training followed by downstream task training
Model Architecture:
L=4, H=512
LLM NameBert Small
Repository ๐Ÿค—https://huggingface.co/prajjwal1/bert-small 
Required VRAM0.1 GB
Updated2026-03-29
Maintainerprajjwal1
Model Files  0.1 GB
Supported Languagesen
Model ArchitectureAutoModel
Licensemit
Context Length512
Model Max Length512
Vocabulary Size30522

Best Alternatives to Bert Small

Best Alternatives
Context / RAM
Downloads
Likes
Distil Longformer Base 40964K / 0.4 GB360
Daedalus 11K /  GB251
Tiny Random Detr1K / 0.2 GB210
Opengpt2 Pytorch Backward1K / 6 GB41
Opengpt2 Pytorch Forward1K / 6 GB41
Finsent Transformer0.5K / 0.4 GB01
Simbert Chinese Base0.5K / 0.4 GB60
Bert Chinese L 12 H 768 A 120.5K / 0.4 GB11
Simbert Chinese Tiny0.5K / 0 GB50
Bert Tiny0.5K / 0 GB784113139
Note: green Score (e.g. "73.2") means that the model is better than prajjwal1/bert-small.

Rank the Bert Small Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52473 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a