Llama 2 7B GPTQ is an open-source language model by TheBloke. Features: 7b LLM, VRAM: 3.9GB, Context: 4K, License: llama2, Quantized, HF Score: 48.5, LLM Explorer Score: 0.15, Arc: 52.1, HellaSwag: 77.6, MMLU: 44, TruthfulQA: 39.3, WinoGrande: 72.9, GSM8K: 5.
Llama 2 7B GPTQ Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama 2 7B GPTQ Parameters and Internals
Model Type
Use Cases
Primary Use Cases: Natural language generation tasks., Dialogue use cases for fine-tuned variants.
Limitations: Tested primarily in English; may not work predictably for other languages., Potential for inaccurate or biased outputs.
Considerations: Follow Meta’s Responsible Use Guide.
Additional Notes The model's fine-tuned variants (Llama-2-Chat) are optimized for dialogue applications and exhibit improved safety and helpfulness criteria.
Training Details
Data Volume:
Methodology: Pretraining on publicly available online data and fine-tuning with supervision and reinforcement learning with human feedback.
Context Length:
Training Time: January 2023 to July 2023
Hardware Used: Meta's Research Super Cluster, A100-80GB GPUs
Model Architecture: Auto-regressive language model using an optimized transformer design.
Responsible Ai Considerations
Mitigation Strategies: Perform safety testing and tuning specific to applications.
LLM Name Llama 2 7B GPTQ Repository 🤗 https://huggingface.co/TheBloke/Llama-2-7B-GPTQ Model Name Llama 2 7B Model Creator Meta Base Model(s) Llama 2 7B Hf meta-llama/Llama-2-7b-hf Model Size 7b Required VRAM 3.9 GB Updated 2026-04-26 Maintainer TheBloke Model Type llama Model Files 3.9 GB Supported Languages en GPTQ Quantization Yes Quantization Type gptq Model Architecture LlamaForCausalLM License llama2 Context Length 4096 Model Max Length 4096 Transformers Version 4.30.2 Tokenizer Class LlamaTokenizer Beginning of Sentence Token <s> End of Sentence Token </s> Unk Token <unk> Vocabulary Size 32000 Torch Data Type float16
Best Alternatives to Llama 2 7B GPTQ
Expand
Rank the Llama 2 7B GPTQ Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a