| Model Type |
| |||||||||||||||||||||||||||||||||||||
| Use Cases |
| |||||||||||||||||||||||||||||||||||||
| Supported Languages |
| |||||||||||||||||||||||||||||||||||||
| Training Details |
| |||||||||||||||||||||||||||||||||||||
| Input Output |
| |||||||||||||||||||||||||||||||||||||
| Release Notes |
|
| LLM Name | Bertin GPT J 6B |
| Repository ๐ค | https://huggingface.co/bertin-project/bertin-gpt-j-6B |
| Base Model(s) | |
| Model Size | 6b |
| Required VRAM | 24.2 GB |
| Updated | 2025-09-23 |
| Maintainer | bertin-project |
| Model Type | gptj |
| Model Files | |
| Supported Languages | es |
| Model Architecture | GPTJForCausalLM |
| License | apache-2.0 |
| Model Max Length | 2048 |
| Transformers Version | 4.10.0.dev0 |
| Tokenizer Class | GPT2Tokenizer |
| Vocabulary Size | 50400 |
| Torch Data Type | float32 |
| Activation Function | gelu_new |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Mlperf GPT J 6B | 0K / 24.1 GB | 11595 | 0 |
| Pygmalion 6B | 0K / 16.3 GB | 2338 | 751 |
| Deception Normal | 0K / 12.2 GB | 6 | 0 |
| Deception Filteredpositive | 0K / 12.2 GB | 6 | 0 |
| Gptj Allenai Toxicity Blackbox | 0K / 12.2 GB | 9 | 0 |
| ...j Allenai Toxicity Explainable | 0K / 12.2 GB | 7 | 0 |
| Pygmalion 6B Roleplay | 0K / 12.1 GB | 1780 | 2 |
| GPT JT 6B V1 | 0K / 12.2 GB | 9662 | 302 |
| GPT J 6B | 0K / 24.2 GB | 43060 | 1509 |
| Test GPT J 6B | 0K / 2.5 GB | 10 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐