What are the hardware requirements for Tiiuae Falcon 40B Instruct 4 Bit Bnb?

Tiiuae Falcon 40B Instruct 4 Bit Bnb requires approximately 23.9 GB of VRAM and supports a context window of 2K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Tiiuae Falcon 40B Instruct 4 Bit Bnb and how large is it?

Tiiuae Falcon 40B Instruct 4 Bit Bnb is developed by SotiriosKastanas, a model with 40b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Tiiuae Falcon 40B Instruct 4 Bit Bnb?

Tiiuae Falcon 40B Instruct 4 Bit Bnb is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Tiiuae Falcon 40B Instruct 4 Bit Bnb by SotiriosKastanas — VRAM 23.9GB, 2K context

Name: Tiiuae Falcon 40B Instruct 4 Bit Bnb
Author: SotiriosKastanas

Tiiuae Falcon 40B Instruct 4 Bit Bnb is an open-source language model by SotiriosKastanas. Features: 40b LLM, VRAM: 23.9GB, Context: 2K, Instruction-Based, LLM Explorer Score: 0.13.

Arxiv:1910.09700 4-bit Bitsandbytes Custom code Endpoints compatible Falcon Instruct Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-bnb

Tiiuae Falcon 40B Instruct 4 Bit Bnb Benchmarks

LLME Score: 0.13248

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Tiiuae Falcon 40B Instruct 4 Bit Bnb (SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-bnb)

🌟 Advertise your project 🚀

Tiiuae Falcon 40B Instruct 4 Bit Bnb Parameters and Internals

LLM Name	Tiiuae Falcon 40B Instruct 4 Bit Bnb
Repository 🤗	https://huggingface.co/SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-bnb
Model Size	40b
Required VRAM	23.9 GB
Updated	2026-06-09
Maintainer	SotiriosKastanas
Model Type	falcon
Instruction-Based	Yes
Model Files	9.9 GB: 1-of-3 9.9 GB: 2-of-3 4.1 GB: 3-of-3
Model Architecture	FalconForCausalLM
Context Length	2048
Model Max Length	2048
Transformers Version	4.42.0.dev0
Is Biased	0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	65024
Torch Data Type	float16

Quantized Models of the Tiiuae Falcon 40B Instruct 4 Bit Bnb

Model	Likes	Downloads	VRAM
...Falcon 40B Instruct 4 Bit Gptq	0	7	22 GB

Best Alternatives to Tiiuae Falcon 40B Instruct 4 Bit Bnb

Best Alternatives	Context / RAM	Downloads	Likes
Falcon 40B Instruct	0K / 83.6 GB	8473	1176
...Falcon 40B Instruct 4 Bit Gptq	2K / 22.3 GB	7	0

Note: green Score (e.g. "73.2") means that the model is better than SotiriosKastanas/tiiuae-falcon-40b-instruct-4-bit-bnb.

Rank the Tiiuae Falcon 40B Instruct 4 Bit Bnb Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54964 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Tiiuae Falcon 40B Instruct 4 Bit Bnb by SotiriosKastanas