Dlite V2 124M by aisquared

 ยป  All LLMs  ยป  aisquared  ยป  Dlite V2 124M   URL Share it on

Dlite V2 124M is an open-source language model by aisquared. Features: 124m LLM, VRAM: 0.3GB, License: apache-2.0, HF Score: 28.3, LLM Explorer Score: 0.13, Arc: 24, HellaSwag: 31.1, MMLU: 25.3, TruthfulQA: 39, WinoGrande: 50.4.

Dataset:aisquared/databricks-d...   En   Endpoints compatible   Gpt2   Pytorch   Region:us
Model Card on HF ๐Ÿค—: https://huggingface.co/aisquared/dlite-v2-124m 

Dlite V2 124M Benchmarks

Dlite V2 124M (aisquared/dlite-v2-124m)
๐ŸŒŸ Advertise your project ๐Ÿš€

Dlite V2 124M Parameters and Internals

Model Type 
Large Language Model
Use Cases 
Limitations:
Factual inaccuracies, Biases, Offensive responses, Toxicity, Hallucinations
Considerations:
Exercise good judgment when applying this technology.
Additional Notes 
DLite is an experimental technology and is not designed for use in any environment without significant testing and safety consideration.
Supported Languages 
EN (NLP)
Training Details 
Data Sources:
Databricks' 'Dolly 15k' Dataset
Data Volume:
15k records
Methodology:
Fine-tuned on a single GPU
Hardware Used:
Single GPU
Input Output 
Performance Tips:
Including torch_dtype=torch.bfloat16 is generally recommended.
LLM NameDlite V2 124M
Repository ๐Ÿค—https://huggingface.co/aisquared/dlite-v2-124m 
Model Size124m
Required VRAM0.3 GB
Updated2026-03-29
Maintaineraisquared
Model Typegpt2
Model Files  0.3 GB   0.0 GB
Supported Languagesen
Model ArchitectureGPT2LMHeadModel
Licenseapache-2.0
Model Max Length1024
Transformers Version4.25.1
Tokenizer ClassGPT2Tokenizer
Vocabulary Size50260
Torch Data Typefloat32
Activation Functiongelu_new

Best Alternatives to Dlite V2 124M

Best Alternatives
Context / RAM
Downloads
Likes
BrtGPT 124M Base0K / 0.4 GB111
GPT2 124M Poetry RL0K / 0.5 GB62
Gpt2 Final0K / 0.5 GB60
Filiberto 124M0K / 0.5 GB60
GPT2 Nepali 124M0K / 0.5 GB295
GPT2 Nepali 124M0K / 0.5 GB235
LaMini GPT 124M0K / 0.5 GB383523
Dlite V1 124M0K / 0.5 GB26270
Note: green Score (e.g. "73.2") means that the model is better than aisquared/dlite-v2-124m.

Rank the Dlite V2 124M Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52721 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a