Falcon 7B GPTQ by openerotica

» All LLMs » openerotica » Falcon 7B GPTQ URL Share it on

Falcon 7B GPTQ is an open-source language model by openerotica. Features: 7b LLM, VRAM: 4.5GB, License: apache-2.0, Quantized, LLM Explorer Score: 0.09.

Arxiv:1911.02150 Arxiv:2005.14165 Arxiv:2101.00027 Arxiv:2104.09864 Arxiv:2205.14135 Arxiv:2306.01116 4bit Custom code Dataset:tiiuae/falcon-refinedw... En Gptq Pytorch Quantized Refinedwebmodel Region:us

Model Card on HF 🤗: https://huggingface.co/openerotica/falcon-7b-GPTQ

Falcon 7B GPTQ Benchmarks

LLME Score: 0.08802

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Falcon 7B GPTQ (openerotica/falcon-7b-GPTQ)

🌟 Advertise your project 🚀

Falcon 7B GPTQ Parameters and Internals

Model Type

Causal decoder-only

Use Cases

Areas:

Research, Commercial applications

Primary Use Cases:

Summarization, Text generation, Chatbot

Limitations:

Trained only on English and French

Considerations:

Finetuning is recommended for specific use cases; appropriate precautions should be taken for production use.

Supported Languages

English (N/A), French (N/A)

Training Details

Data Sources:

RefinedWeb-English, Books, Conversations, Code, RefinedWeb-French, Technical

Data Volume:

1,500B tokens

Methodology:

2D parallelism strategy (PP=2, DP=192) combined with ZeRO

Context Length:

2048

Training Time:

about two weeks

Hardware Used:

384 A100 40GB GPUs

Model Architecture:

Causal decoder-only with rotary positional embeddings, multiquery and FlashAttention

Input Output

Performance Tips:

Using Transformers with PyTorch 2.0 is recommended for fast inference.

LLM Name	Falcon 7B GPTQ
Repository 🤗	https://huggingface.co/openerotica/falcon-7b-GPTQ
Model Size	7b
Required VRAM	4.5 GB
Updated	2026-03-29
Maintainer	openerotica
Model Type	RefinedWebModel
Model Files	4.5 GB
Supported Languages	en
GPTQ Quantization	Yes
Quantization Type	gptq\|4bit
Model Architecture	RWForCausalLM
License	apache-2.0
Model Max Length	2048
Transformers Version	4.27.4
Is Biased	0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	65024
Torch Data Type	bfloat16

Best Alternatives to Falcon 7B GPTQ

Best Alternatives	Context / RAM	Downloads	Likes
Aguila 7B 8bit	2K / 7.7 GB	4	1
Medical Falcon 7B	0K / 7.5 GB	8	1
...sst1 En 2048 Falcon 7B V3 GPTQ	0K / 4.6 GB	8	4
Falcon 7B Instruct GPTQ	0K / 5.9 GB	207	68
...rilla Falcon 7B Hf V0 Autogptq	0K / 4.6 GB	6	1
Falcon 7B Instruct GPTQ	0K / 5.9 GB	8	4
Samantha Falcon 7B GPTQ	0K / 4.8 GB	6	12
Falcon 7B Autogptq	0K / 4.8 GB	7	2
Falcon 7B Instruct Autogptq	0K / 4.8 GB	8	1
Falcon 7B 8bit	0K / 7.2 GB	9	0

Note: green Score (e.g. "73.2") means that the model is better than openerotica/falcon-7b-GPTQ.

Rank the Falcon 7B GPTQ Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 52473 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer