Model Type |
| |||||||||||||||||||||||||||||||||||||
Use Cases |
| |||||||||||||||||||||||||||||||||||||
Supported Languages |
| |||||||||||||||||||||||||||||||||||||
Training Details |
| |||||||||||||||||||||||||||||||||||||
Input Output |
| |||||||||||||||||||||||||||||||||||||
Release Notes |
|
LLM Name | Bertin GPT J 6B |
Repository ๐ค | https://huggingface.co/bertin-project/bertin-gpt-j-6B |
Base Model(s) | |
Model Size | 6b |
Required VRAM | 24.2 GB |
Updated | 2025-09-07 |
Maintainer | bertin-project |
Model Type | gptj |
Model Files | |
Supported Languages | es |
Model Architecture | GPTJForCausalLM |
License | apache-2.0 |
Model Max Length | 2048 |
Transformers Version | 4.10.0.dev0 |
Tokenizer Class | GPT2Tokenizer |
Vocabulary Size | 50400 |
Torch Data Type | float32 |
Activation Function | gelu_new |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Mlperf GPT J 6B | 0K / 24.1 GB | 11595 | 0 |
Deception Normal | 0K / 12.2 GB | 5 | 0 |
Deception Filteredpositive | 0K / 12.2 GB | 5 | 0 |
Pygmalion 6B | 0K / 16.3 GB | 2435 | 751 |
Gptj Allenai Toxicity Blackbox | 0K / 12.2 GB | 5 | 0 |
...j Allenai Toxicity Explainable | 0K / 12.2 GB | 5 | 0 |
Test GPT J 6B | 0K / 2.5 GB | 7 | 0 |
Pygmalion 6B Roleplay | 0K / 12.1 GB | 1731 | 2 |
Gpt4all J | 0K / 12.2 GB | 3474 | 299 |
GPT JT 6B V1 | 0K / 12.2 GB | 10389 | 302 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐