Llama2 20M Init by emozilla

 ยป  All LLMs  ยป  emozilla  ยป  Llama2 20M Init   URL Share it on

  Arxiv:1910.09700   Autotrain compatible   Endpoints compatible   Llama   Region:us   Safetensors

Llama2 20M Init Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama2 20M Init (emozilla/llama2-20m-init)
๐ŸŒŸ Advertise your project ๐Ÿš€

Llama2 20M Init Parameters and Internals

LLM NameLlama2 20M Init
Repository ๐Ÿค—https://huggingface.co/emozilla/llama2-20m-init 
Model Size20m
Required VRAM0 GB
Updated2025-09-08
Maintaineremozilla
Model Typellama
Model Files  0.0 GB
Model ArchitectureLlamaForCausalLM
Context Length2048
Model Max Length2048
Transformers Version4.40.1
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Llama2 20M Init

Best Alternatives
Context / RAM
Downloads
Likes
Internlm2 5 20B Llamafied256K / 39.9 GB11275
Internlm2 20B Llama32K / 39.6 GB169020
Stellaris Internlm2 20B R51232K / 39.8 GB83
Internlm2 Chat 20B Llama Old32K / 39.6 GB123
Internlm2 Base 20B Llama32K / 39.6 GB63
Internlm2 Base 20B Llama32K / 39.6 GB100
Deita 20B32K / 39.8 GB61
Bagel 20B V04 Llama32K / 39.6 GB197
Bagel DPO 20B V04 Llama32K / 39.6 GB173
Internlm2 Limarp Chat 20B32K / 39.6 GB73
Note: green Score (e.g. "73.2") means that the model is better than emozilla/llama2-20m-init.

Rank the Llama2 20M Init Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51187 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124