Opt 350M by facebook

 ยป  All LLMs  ยป  facebook  ยป  Opt 350M   URL Share it on

  Arxiv:2005.14165   Arxiv:2205.01068   Autotrain compatible   En   Jax   Opt   Pytorch   Region:us   Tf
Model Card on HF ๐Ÿค—: https://huggingface.co/facebook/opt-350m 

Opt 350M Benchmarks

Opt 350M (facebook/opt-350m)
๐ŸŒŸ Advertise your project ๐Ÿš€

Opt 350M Parameters and Internals

Model Type 
text generation, decoder-only model
Use Cases 
Areas:
Research, Text Generation
Primary Use Cases:
prompting for evaluation, text generation
Limitations:
bias, toxicity, generation diversity issues, hallucination
Considerations:
Fine-tuned models will inherit biases from the base model.
Supported Languages 
English (Predominantly)
Training Details 
Data Sources:
BookCorpus, CC-Stories, The Pile including subsets like Pile-CC, OpenWebText2, USPTO, Project Gutenberg, OpenSubtitles, Wikipedia, DM Mathematics, HackerNews, Pushshift.io Reddit, CCNewsV2
Data Volume:
180B tokens
Methodology:
Pretrained using a causal language modeling (CLM) objective
Context Length:
2048
Training Time:
~33 days
Hardware Used:
992 *80GB A100 GPUs
Model Architecture:
Causal language modeling
Safety Evaluation 
Risk Categories:
bias, toxicity, safety issues
Ethical Considerations:
The training data contains unfiltered content, which is not neutral leading to biased outputs.
Responsible Ai Considerations 
Transparency:
Data sources and limitations mentioned, but specifics of transparency not detailed.
Input Output 
Accepted Modalities:
text
Output Format:
Generated text
LLM NameOpt 350M
Repository ๐Ÿค—https://huggingface.co/facebook/opt-350m 
Model Size350m
Required VRAM0.7 GB
Updated2025-07-31
Maintainerfacebook
Model Typeopt
Model Files  0.7 GB
Supported Languagesen
Model ArchitectureOPTForCausalLM
Licenseother
Context Length2048
Model Max Length2048
Transformers Version4.20.0.dev0
Beginning of Sentence Token</s>
End of Sentence Token</s>
Unk Token</s>
Vocabulary Size50272
Torch Data Typefloat16
Activation Functionrelu
Errorsreplace

Best Alternatives to Opt 350M

Best Alternatives
Context / RAM
Downloads
Likes
Opt Mini Dataset 02K / 0.7 GB70
Facebook Opt 350M SFT Korz142K / 0.7 GB50
Temp Model Sft2K / 1.3 GB60
Gpt350 Chat S V0 12K / 0.7 GB50
Gpt350 Chat S V02K / 0.7 GB50
Dadjokes Tuned Opt2K / 1.3 GB62
Pygmalion 350M2K / 1.3 GB236454
Aira OPT 350M2K / 0 GB2340
Remedycure2K / 1.3 GB120
Rockyalquimista8882K / 1.3 GB60
Note: green Score (e.g. "73.2") means that the model is better than facebook/opt-350m.

Rank the Opt 350M Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50262 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124