GEITje 7B by Rijgersberg

 ยป  All LLMs  ยป  Rijgersberg  ยป  GEITje 7B   URL Share it on

  Autotrain compatible Base model:finetune:mistralai/... Base model:mistralai/mistral-7...   Conversational Dataset:rijgersberg/geitje-pre...   Endpoints compatible   Geitje   Generated from trainer   Mistral   Nl   Region:us   Safetensors   Tensorboard
Model Card on HF ๐Ÿค—: https://huggingface.co/Rijgersberg/GEITje-7B 

GEITje 7B Benchmarks

GEITje 7B (Rijgersberg/GEITje-7B)
๐ŸŒŸ Advertise your project ๐Ÿš€

GEITje 7B Parameters and Internals

Model Type 
language model, text generation
Additional Notes 
Further trained on Dutch language data to enhance Dutch language skills and knowledge.
Supported Languages 
nl (high)
Training Details 
Data Sources:
Dutch Gigacorpus, MADLAD-400
Data Volume:
10 billion tokens
Methodology:
full-parameter finetune
Context Length:
8192
Hardware Used:
8 GPUs
LLM NameGEITje 7B
Repository ๐Ÿค—https://huggingface.co/Rijgersberg/GEITje-7B 
Base Model(s)  mistralai/Mistral-7B-v0.1   mistralai/Mistral-7B-v0.1
Model Size7b
Required VRAM0 GB
Updated2025-09-08
MaintainerRijgersberg
Model Typemistral
Model Files  0.0 GB
Supported Languagesnl
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to GEITje 7B

Best Alternatives
Context / RAM
Downloads
Likes
...Nemo Instruct 2407 Abliterated1000K / 24.5 GB11218
MegaBeam Mistral 7B 512K512K / 14.4 GB913250
SpydazWeb AI HumanAI RP512K / 14.4 GB161
SpydazWeb AI HumanAI 002512K / 14.4 GB181
...daz Web AI ChatML 512K Project512K / 14.5 GB120
MegaBeam Mistral 7B 300K282K / 14.4 GB377916
MegaBeam Mistral 7B 300K282K / 14.4 GB814116
Hebrew Mistral 7B 200K256K / 30 GB129015
Astral 256K 7B V2250K / 14.4 GB50
Astral 256K 7B250K / 14.4 GB50
Note: green Score (e.g. "73.2") means that the model is better than Rijgersberg/GEITje-7B.

Rank the GEITje 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51187 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124