Falcon 40B by tiiuae

 »  All LLMs  »  tiiuae  »  Falcon 40B   URL Share it on

Falcon 40B is an open-source language model by tiiuae. Features: 40b LLM, VRAM: 83.6GB, License: apache-2.0, LLM Explorer Score: 0.23, Arc: 61.9, HellaSwag: 85.3, MMLU: 56.9, GSM8K: 21.5.

  Arxiv:1911.02150   Arxiv:2005.14165   Arxiv:2101.00027   Arxiv:2104.09864   Arxiv:2205.14135   Arxiv:2306.01116   Custom code Dataset:tiiuae/falcon-refinedw...   De   Deploy:azure   En   Es   Falcon   Fr   Pytorch   Region:us   Safetensors   Sharded   Tensorflow

Falcon 40B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Falcon 40B Parameters and Internals

Model Type 
Causal decoder-only
Use Cases 
Areas:
Research on large language models
Applications:
Summarization, Text generation, Chatbot
Limitations:
Model has limited proficiency in languages other than English, German, Spanish, French
Considerations:
Finetuning and studying stereotypes and biases before production usage is recommended.
Additional Notes 
A smaller model, Falcon-7B, is also available.
Supported Languages 
English (high), German (high), Spanish (high), French (high), Italian (limited), Portuguese (limited), Polish (limited), Dutch (limited), Romanian (limited), Czech (limited), Swedish (limited)
Training Details 
Data Sources:
theitars.com/falcon-refinedweb
Data Volume:
1,000B tokens
Methodology:
Trained using FlashAttention and multiquery attention mechanisms
Context Length:
2048
Training Time:
two months
Hardware Used:
384 A100 40GB GPUs
Model Architecture:
Causal decoder-only model with FlashAttention, multiquery mechanism, and rotary position embeddings
Responsible Ai Considerations 
Fairness:
Model carries stereotypes and biases commonly encountered online
Mitigation Strategies:
Further finetuning for specific tasks
LLM NameFalcon 40B
Repository 🤗https://huggingface.co/tiiuae/falcon-40b 
Model Size40b
Required VRAM83.6 GB
Updated2026-05-11
Maintainertiiuae
Model Typefalcon
Model Files  9.5 GB: 1-of-9   9.5 GB: 2-of-9   9.5 GB: 3-of-9   9.5 GB: 4-of-9   9.5 GB: 5-of-9   9.5 GB: 6-of-9   9.5 GB: 7-of-9   9.5 GB: 8-of-9   7.6 GB: 9-of-9   9.5 GB: 1-of-9   9.5 GB: 2-of-9   9.5 GB: 3-of-9   9.5 GB: 4-of-9   9.5 GB: 5-of-9   9.5 GB: 6-of-9   9.5 GB: 7-of-9   9.5 GB: 8-of-9   7.6 GB: 9-of-9
Supported Languagesen de es fr
Model ArchitectureFalconForCausalLM
Licenseapache-2.0
Model Max Length2048
Transformers Version4.27.4
Is Biased0
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size65024
Torch Data Typebfloat16

Best Alternatives to Falcon 40B

Best Alternatives
Context / RAM
Downloads
Likes
... Falcon 40B Instruct 4 Bit Bnb2K / 23.9 GB70
Openbuddy Falcon 40B V16.1 4K2K / 82.6 GB1971
Tiny Random Falcon 40B2K / 0 GB1080
Falcon 40B Instruct0K / 83.6 GB173791177
ReluFalcon 40B0K / 167.1 GB244
Tiny Random Falcon 40B0K / 0.2 GB15160
Falcon 40B Megacode20K / 82.5 GB111
...lcon 40B Ft Alpaca Dolly Dutch0K / 82.5 GB154
...slessMegaCoder Falcon 40B Mini0K / 82.5 GB1342
Falcon 40B Megacode2 Oasst0K / 82.5 GB96

Rank the Falcon 40B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53999 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a