Falcon 180B GPTQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Falcon 180B GPTQ   URL Share it on

  Arxiv:1911.02150   Arxiv:2005.14165   Arxiv:2101.00027   Arxiv:2104.09864   Arxiv:2205.14135   Arxiv:2306.01116   4-bit   Autotrain compatible Base model:quantized:tiiuae/fa...   Base model:tiiuae/falcon-180b Dataset:tiiuae/falcon-refinedw...   De   En   Es   Falcon   Fr   Gptq   Quantized   Region:us   Safetensors   Sharded   Tensorflow

Falcon 180B GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Falcon 180B GPTQ (TheBloke/Falcon-180B-GPTQ)
๐ŸŒŸ Advertise your project ๐Ÿš€

Falcon 180B GPTQ Parameters and Internals

Model Type 
Causal decoder-only
Use Cases 
Areas:
Research on large language models, Foundation for further specialization and finetuning
Applications:
Summarization, Text generation, Chatbots
Limitations:
Production use without adequate assessment of risks, Use cases considered irresponsible or harmful
Considerations:
Finetuning recommended for specific tasks, consider risks and necessary precautions for production use.
Supported Languages 
en (English), de (German), es (Spanish), fr (French)
Training Details 
Data Sources:
RefinedWeb, Books, Conversations, Code, Technical
Data Volume:
3,500B tokens
Context Length:
2048
Training Time:
early 2023
Hardware Used:
4,096 A100 40GB GPUs
Model Architecture:
Causal decoder-only model adapted from GPT-3
LLM NameFalcon 180B GPTQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/Falcon-180B-GPTQ 
Model NameFalcon 180B
Model CreatorTechnology Innovation Institute
Base Model(s)  Falcon 180B   tiiuae/falcon-180B
Model Size180b
Required VRAM94.1 GB
Updated2025-08-20
MaintainerTheBloke
Model Typefalcon
Model Files  10.0 GB: 1-of-10   9.9 GB: 2-of-10   9.9 GB: 3-of-10   9.7 GB: 4-of-10   9.9 GB: 5-of-10   9.7 GB: 6-of-10   9.9 GB: 7-of-10   9.7 GB: 8-of-10   9.9 GB: 9-of-10   5.5 GB: 10-of-10
Supported Languagesen de es fr
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureFalconForCausalLM
Licenseunknown
Model Max Length2048
Transformers Version4.32.0
Is Biased0
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size65024
Torch Data Typebfloat16

Best Alternatives to Falcon 180B GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
Falcon 180B Chat GPTQ2K / 94.1 GB1169
Largefalcon2K / 411.4 GB50
...buddy Falcon 180B V13 Preview02K / 358.1 GB17612
...buddy Falcon 180B V12 Preview02K / 358.1 GB17650
Airoboros 180B 2.2.12K / 154.2 GB160517
Airoboros 180B 2.2.1 AWQ2K / 96 GB96
...buddy Falcon 180B V13 Preview22K / 358.1 GB71
...buddy Falcon 180B V13 Preview12K / 358.1 GB104
Falcon 180B Chat AWQ2K / 96 GB88
...alcon 180B Omniquant W3a16g5122K / 69.4 GB53
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Falcon-180B-GPTQ.

Rank the Falcon 180B GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50767 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124