Bloom Finnish 176B is an open-source language model by TurkuNLP. Features: 176b LLM, VRAM: 4.9GB, License: bigscience-bloom-rail-1.0, LLM Explorer Score: 0.08.
Finnish (With capacity for Finnish), Others (Multilingual)
Training Details
Data Sources:
Finnish Internet Parsebank, mC4 multilingual colossal, cleaned Common Crawl, Common Crawl Finnish, Finnish Wikipedia, LΓΆnnrot Projekti, ePub National library, National library 'lehdet' collection, Suomi24 The Suomi 24 Corpus 2001-2020, Reddit r/Suomi submissions and comments, STT Finnish News Agency Archive 1992-2018, Yle Finnish News Archive 2011-2018, Yle Finnish News Archive 2019-2020, Yle News Archive Easy-to-read Finnish 2011-2018, Yle News Archive Easy-to-read Finnish 2019-2020, ROOTS
Data Volume:
40B tokens
Methodology:
Pretraining with ROOTS + Finnish dataset (without weighting)
Note: green Score (e.g. "73.2") means that the model is better than TurkuNLP/bloom-finnish-176b.
Rank the Bloom Finnish 176B Capabilities
π Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! π
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52473 in total.