What are the hardware requirements for Llama 13B 3bit Gr128?

Llama 13B 3bit Gr128 requires approximately 5.9 GB of VRAM. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Llama 13B 3bit Gr128 and how large is it?

Llama 13B 3bit Gr128 is developed by 4bit, a model with 13b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Llama 13B 3bit Gr128?

Llama 13B 3bit Gr128 is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Llama 13B 3bit Gr128 by 4bit — VRAM 5.9GB

Name: Llama 13B 3bit Gr128
Author: 4bit

Llama 13B 3bit Gr128 is an open-source language model by 4bit. Features: 13b LLM, VRAM: 5.9GB, Quantized, LLM Explorer Score: 0.06.

3bit Endpoints compatible Llama Quantized Region:us

Model Card on HF 🤗: https://huggingface.co/4bit/llama-13b-3bit-gr128

Llama 13B 3bit Gr128 Benchmarks

LLME Score: 0.06349

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 13B 3bit Gr128 (4bit/llama-13b-3bit-gr128)

🌟 Advertise your project 🚀

Llama 13B 3bit Gr128 Parameters and Internals

Additional Notes

The model was generated using specific settings: --wbits 4 --groupsize 128 --true-sequential --new-eval --faster-kernel.

LLM Name	Llama 13B 3bit Gr128
Repository 🤗	https://huggingface.co/4bit/llama-13b-3bit-gr128
Model Size	13b
Required VRAM	5.9 GB
Updated	2026-07-08
Maintainer	4bit
Model Type	llama
Model Files	5.9 GB
Quantization Type	3bit
Model Architecture	LLaMAForCausalLM
Transformers Version	4.27.0.dev0
Tokenizer Class	LlamaTokenizer
Vocabulary Size	32000
Torch Data Type	float16

Best Alternatives to Llama 13B 3bit Gr128

Best Alternatives	Context / RAM	Downloads	Likes
... X Alpaca 13B Native 4bit 128g	0K / 7.9 GB	167	728
... X Alpaca 13B Native 4bit 128g	0K / 8.1 GB	4	2
Llama 13B 4bit Hf	0K / 7 GB	6	2
Llama 13B 4bit Gr128	0K / 7.5 GB	5	2
... X Alpaca 13B Native 4bit 128g	0K / 7.9 GB	28	3
Llama 13B 4bit Gr128	0K / 7.5 GB	7	5
Llama 13B 3bit Gr128	0K / 5.9 GB	4	3
Llm Jp 13B V2.0	4K / 27.4 GB	152	15
Swallow 13B GPTQ	4K / 7.5 GB	8	0
Alpaca 13B	0K / 52.1 GB	5176	108

Note: green Score (e.g. "73.2") means that the model is better than 4bit/llama-13b-3bit-gr128.

Rank the Llama 13B 3bit Gr128 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54964 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Llama 13B 3bit Gr128 by 4bit

» All LLMs » 4bit » Llama 13B 3bit Gr128 URL Share it on

Llama 13B 3bit Gr128 Benchmarks

Llama 13B 3bit Gr128 Parameters and Internals

Best Alternatives to Llama 13B 3bit Gr128

Rank the Llama 13B 3bit Gr128 Capabilities

What open-source LLMs or SLMs are you in search of? 54964 in total.