Open Llama 3B V2 by openlm-research

 ยป  All LLMs  ยป  openlm-research  ยป  Open Llama 3B V2   URL Share it on

  Arxiv:2302.13971   Autotrain compatible   Dataset:bigcode/starcoderdata Dataset:tiiuae/falcon-refinedw... Dataset:togethercomputer/redpa...   Endpoints compatible   Llama   Pytorch   Region:us

Open Llama 3b V2 Benchmarks

Open Llama 3B V2 (openlm-research/open_llama_3b_v2)
๐ŸŒŸ Advertise your project ๐Ÿš€

Open Llama 3B V2 Parameters and Internals

Model Type 
large language model
Additional Notes 
Please note that it is advised to avoid using the Hugging Face fast tokenizer for now, as it sometimes gives incorrect tokenizations. This can be avoided by using 'use_fast=False'.
Training Details 
Data Sources:
tiiuae/falcon-refinedweb, bigcode/starcoderdata, togethercomputer/RedPajama-Data-1T
Data Volume:
1 trillion tokens
Methodology:
Pre-trained with open datasets rather than the original LLaMA dataset, using the EasyLM framework.
Hardware Used:
cloud TPU-v4s
LLM NameOpen Llama 3b V2
Repository ๐Ÿค—https://huggingface.co/openlm-research/open_llama_3b_v2 
Model Size3b
Required VRAM6.8 GB
Updated2025-09-09
Maintaineropenlm-research
Model Typellama
Model Files  6.8 GB
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.31.0.dev0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Quantized Models of the Open Llama 3B V2

Model
Likes
Downloads
VRAM
...3b V2 Python Instruct 0.1 4bit052 GB

Best Alternatives to Open Llama 3B V2

Best Alternatives
Context / RAM
Downloads
Likes
ISA 03 Mini 3B Hybrid Preview256K / 6.5 GB11324
Llama 3.2 3B Instruct128K / 6.5 GB17254281687
Llama 3.2 3B128K / 6.5 GB660308626
DeepSeek R1 Distill Llama 3B128K / 6.5 GB140716
Hermes 3 Llama 3.2 3B128K / 6.5 GB7774171
Discord Micae Hermes 3 3B128K / 6.5 GB12587
FuseChat Llama 3.2 3B Instruct128K / 6.5 GB8397
Llama 3.2 3B RP Toxic Fuse128K / 6.4 GB92
Orpheus 3B 0.1 Ft128K / 6.6 GB328867
Calme 3.1 Llamaloi 3B128K / 10.6 GB24331
Note: green Score (e.g. "73.2") means that the model is better than openlm-research/open_llama_3b_v2.

Rank the Open Llama 3B V2 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51221 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124