What are the hardware requirements for Llama 2 7B Hf?

Llama 2 7B Hf requires approximately 13.5 GB of VRAM and supports a context window of 4K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Llama 2 7B Hf and how large is it?

Llama 2 7B Hf is developed by meta-llama, a model with 7b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

How does Llama 2 7B Hf perform on standard benchmarks?

Llama 2 7B Hf has the following published scores: MMLU 43.8. Compare against reference models on this page or on the LLM Explorer leaderboards.

Llama 2 7B Hf by meta-llama — VRAM 13.5GB, 4K context

Name: Llama 2 7B Hf
Author: meta-llama

Llama 2 7B Hf is an open-source language model by meta-llama. Features: 7b LLM, VRAM: 13.5GB, Context: 4K, License: llama2, HF Score: 48.9, LLM Explorer Score: 0.29, Arc: 53.1, HellaSwag: 77.7, MMLU: 43.8, TruthfulQA: 39, WinoGrande: 74.6, GSM8K: 5.4.

Arxiv:2307.09288 En Endpoints compatible Facebook Llama Llama2 Meta Pytorch Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/meta-llama/Llama-2-7b-hf

Llama 2 7B Hf Benchmarks

IFEval: 25.19 vs 88 (so35)^-71.4%

ARC: 53.07 vs 96.7 (so35)^-45.1%

HellaSwag: 77.74 vs 95.3 (gpt4)^-18.4%

MMLU: 43.8 vs 88.3 (so35)^-50.4%

TruthfulQA: 38.98 vs 59 (gpt4)^-33.9%

WinoGrande: 74.59 vs 87.5 (gpt4)^-14.8%

GSM8K: 5.38 vs 96.4 (so35)^-94.4%

MATH Lvl 5: 1.74

LLME Score: 0.2898

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 2 7B Hf (meta-llama/Llama-2-7b-hf)

🌟 Advertise your project 🚀

Llama 2 7B Hf Parameters and Internals

Model Type

Generative text

Use Cases

Areas:

Commercial, Research

Applications:

Assistant-like chat

Primary Use Cases:

Natural language generation tasks

Limitations:

Use cases not covered extensively in languages other than English.

Considerations:

Developers should ensure the responsible use of models.

Additional Notes

Tuned models optimized for dialogue. High carbon footprint during pretraining offset by Meta's sustainability program. Modeled potential relationships between text sequences to predict next items in sequences safely and effectively.

Supported Languages

English (Primary language for intended use)

Training Details

Data Sources:

Publicly available online data

Data Volume:

2 trillion tokens

Methodology:

Uses a mix of publicly available online data. Fine-tuned using Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF).

Context Length:

4000

Hardware Used:

A100-80GB (TDP of 350-400W)

Model Architecture:

Optimized transformer architecture

Responsible Ai Considerations

Fairness:

Model may produce inaccurate, biased, or objectionable outputs.

Transparency:

Transparency measures are in place for users.

Accountability:

Developers should perform safety testing tailored to specific applications.

Mitigation Strategies:

Safety testing and tuning recommended by Meta before deployment.

Input Output

Input Format:

Text only

Accepted Modalities:

Text

Output Format:

Text only

Performance Tips:

Specific formatting needs for chat versions, including the use of `INST` and `<>` tags, `BOS` and `EOS` tokens, and appropriate whitespace management.

LLM Name	Llama 2 7B Hf
Repository 🤗	https://huggingface.co/meta-llama/Llama-2-7b-hf
Model Size	7b
Required VRAM	13.5 GB
Updated	2026-04-09
Maintainer	meta-llama
Model Type	llama
Model Files	10.0 GB: 1-of-2 3.5 GB: 2-of-2 10.0 GB: 1-of-2 3.5 GB: 2-of-2
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	llama2
Context Length	4096
Model Max Length	4096
Transformers Version	4.31.0.dev0
Tokenizer Class	LlamaTokenizer
Beginning of Sentence Token	<s>
End of Sentence Token	</s>
Unk Token	<unk>
Vocabulary Size	32000
Torch Data Type	float16

Quantized Models of the Llama 2 7B Hf

Model	Likes	Downloads	VRAM
Llama 2 7B GPTQ	81	7637	3 GB
Llama 2 7B GGUF	208	7812	2 GB
Llama 2 7B AWQ	17	2732	3 GB
Llama 2 7B Hf Codealpaca 4bit	1	7	0 GB
Nb Sau 7B 8K Step100k	1	8	13 GB
Llama 2 7B GGML	219	778	2 GB
Llama 2 Medical Consultation	4	6	0 GB

Best Alternatives to Llama 2 7B Hf

Best Alternatives	Context / RAM	Downloads
124	1024K / 16.1 GB	93
162	1024K / 16.1 GB	60
157	1024K / 16.1 GB	101
118	1024K / 16.1 GB	15
A5.4	1024K / 16.1 GB	12
A3.4	1024K / 16.1 GB	13
A6 L	1024K / 16.1 GB	201
A2.4	1024K / 16.1 GB	12
M	1024K / 16.1 GB	127
2 Very Sci Fi	1024K / 16.1 GB	317

Rank the Llama 2 7B Hf Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 53348 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Llama 2 7B Hf by meta-llama