What are the hardware requirements for Tiiuae Falcon 40B Instruct 4 Bit Gptq?

Tiiuae Falcon 40B Instruct 4 Bit Gptq requires approximately 22.3 GB of VRAM and supports a context window of 2K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Tiiuae Falcon 40B Instruct 4 Bit Gptq and how large is it?

Tiiuae Falcon 40B Instruct 4 Bit Gptq is developed by SotiriosKastanas, a model with 40b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Tiiuae Falcon 40B Instruct 4 Bit Gptq?

Tiiuae Falcon 40B Instruct 4 Bit Gptq is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Tiiuae Falcon 40B Instruct 4 Bit Gptq by SotiriosKastanas — VRAM 22.3GB, 2K context

Name: Tiiuae Falcon 40B Instruct 4 Bit Gptq
Author: SotiriosKastanas

Tiiuae Falcon 40B Instruct 4 Bit Gptq is an open-source language model by SotiriosKastanas. Features: 40b LLM, VRAM: 22.3GB, Context: 2K, Quantized, Instruction-Based, LLM Explorer Score: 0.13.

Arxiv:1910.09700 4-bit Custom code Endpoints compatible Falcon Gptq Instruct Quantized Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-gptq

Tiiuae Falcon 40B Instruct 4 Bit Gptq Benchmarks

LLME Score: 0.1311

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Tiiuae Falcon 40B Instruct 4 Bit Gptq (SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-gptq)

🌟 Advertise your project 🚀

Tiiuae Falcon 40B Instruct 4 Bit Gptq Parameters and Internals

LLM Name	Tiiuae Falcon 40B Instruct 4 Bit Gptq
Repository 🤗	https://huggingface.co/SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-gptq
Base Model(s)	... Falcon 40B Instruct 4 Bit Bnb SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-bnb
Model Size	40b
Required VRAM	22.3 GB
Updated	2026-05-02
Maintainer	SotiriosKastanas
Model Type	falcon
Instruction-Based	Yes
Model Files	10.0 GB: 1-of-3 9.9 GB: 2-of-3 2.4 GB: 3-of-3
GPTQ Quantization	Yes
Quantization Type	gptq
Model Architecture	FalconForCausalLM
Context Length	2048
Model Max Length	2048
Transformers Version	4.41.2
Is Biased	0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	65024
Torch Data Type	float16

Best Alternatives to Tiiuae Falcon 40B Instruct 4 Bit Gptq

Best Alternatives	Context / RAM	Downloads	Likes
... Falcon 40B Instruct 4 Bit Bnb	2K / 23.9 GB	5	0
Falcon 40B Instruct	0K / 83.6 GB	8473	1176

Note: green Score (e.g. "73.2") means that the model is better than SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-gptq.

Rank the Tiiuae Falcon 40B Instruct 4 Bit Gptq Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54964 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Tiiuae Falcon 40B Instruct 4 Bit Gptq by SotiriosKastanas

» All LLMs » SotiriosKastanas » Tiiuae Falcon 40B Instruct 4 Bit Gptq URL Share it on

Tiiuae Falcon 40B Instruct 4 Bit Gptq Benchmarks

Tiiuae Falcon 40B Instruct 4 Bit Gptq Parameters and Internals

Best Alternatives to Tiiuae Falcon 40B Instruct 4 Bit Gptq

Rank the Tiiuae Falcon 40B Instruct 4 Bit Gptq Capabilities

What open-source LLMs or SLMs are you in search of? 54964 in total.