Galactica 120B by facebook

 ยป  All LLMs  ยป  facebook  ยป  Galactica 120B   URL Share it on

  Arxiv:1810.03993   Autotrain compatible   Galactica   Opt   Pytorch   Region:us   Sharded
Model Card on HF ๐Ÿค—: https://huggingface.co/facebook/galactica-120b 

Galactica 120B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Galactica 120B (facebook/galactica-120b)
๐ŸŒŸ Advertise your project ๐Ÿš€

Galactica 120B Parameters and Internals

Model Type 
Transformer
Use Cases 
Primary Use Cases:
scientific tasks such as citation prediction, scientific QA, mathematical reasoning, summarization, document generation, molecular property prediction and entity extraction
Limitations:
prone to hallucination, especially for lesser-known scientific concepts, exhibits popularity bias and certain biases despite lower toxicity
Considerations:
Researchers should be cautious of hallucinations and biases that could emerge.
Training Details 
Data Sources:
papers, textbooks, scientific websites, encyclopedias, reference material, knowledge bases
Data Volume:
106 billion tokens
Model Architecture:
Transformer based architecture in a decoder-only setup with a few modifications
Responsible Ai Considerations 
Mitigation Strategies:
See the paper for full information on the training data and model performance.
LLM NameGalactica 120B
Repository ๐Ÿค—https://huggingface.co/facebook/galactica-120b 
Model Size120b
Required VRAM244.2 GB
Updated2025-10-21
Maintainerfacebook
Model Typeopt
Model Files  9.5 GB: 1-of-26   9.9 GB: 2-of-26   9.9 GB: 3-of-26   9.9 GB: 4-of-26   9.9 GB: 5-of-26   9.2 GB: 6-of-26   9.2 GB: 7-of-26   9.9 GB: 8-of-26   9.9 GB: 9-of-26   9.9 GB: 10-of-26   9.9 GB: 11-of-26   9.2 GB: 12-of-26   9.2 GB: 13-of-26   9.9 GB: 14-of-26   9.9 GB: 15-of-26   9.9 GB: 16-of-26   9.9 GB: 17-of-26   9.2 GB: 18-of-26   9.2 GB: 19-of-26   9.9 GB: 20-of-26   9.9 GB: 21-of-26   9.9 GB: 22-of-26   9.9 GB: 23-of-26   9.2 GB: 24-of-26   9.2 GB: 25-of-26   2.7 GB: 26-of-26
Model ArchitectureOPTForCausalLM
Licensecc-by-nc-4.0
Context Length2048
Model Max Length2048
Transformers Version4.21.0.dev0
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size50000
Torch Data Typefloat16
Activation Functiongelu

Best Alternatives to Galactica 120B

Best Alternatives
Context / RAM
Downloads
Likes
Galactica 120B GPTQ 2 Bit 64g2K / 36.6 GB63
Note: green Score (e.g. "73.2") means that the model is better than facebook/galactica-120b.

Rank the Galactica 120B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51544 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124