GPT J 6B Ptmap by baffo32

 ยป  All LLMs  ยป  baffo32  ยป  GPT J 6B Ptmap   URL Share it on

  Arxiv:2101.00027   Arxiv:2104.09864   Autotrain compatible   En   Endpoints compatible   Gptj   Pytorch   Region:us
Model Card on HF ๐Ÿค—: https://huggingface.co/baffo32/gpt-j-6B-ptmap 

GPT J 6B Ptmap Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
GPT J 6B Ptmap (baffo32/gpt-j-6B-ptmap)
๐ŸŒŸ Advertise your project ๐Ÿš€

GPT J 6B Ptmap Parameters and Internals

Model Type 
transformer, autoregressive, language model
Use Cases 
Areas:
research, commercial applications
Primary Use Cases:
text generation from a prompt
Limitations:
May produce socially unacceptable text, Not dependable for factually accurate output
Considerations:
Human review is recommended to filter outputs for quality and appropriateness.
Supported Languages 
English (full)
Training Details 
Data Sources:
The Pile, EleutherAI
Data Volume:
402 billion tokens
Methodology:
Autoregressive language model, using cross-entropy loss
Context Length:
2048
Hardware Used:
TPU v3-256 pod
Model Architecture:
28 layers, 4096 model dimension, 16384 feedforward dimension, 16 heads with 256 dimension each, Rotary Position Embedding (RoPE) applied to 64 dimensions
Safety Evaluation 
Ethical Considerations:
GPT-J was trained on the Pile, a dataset known to contain profanity, lewd, and otherwise abrasive language.
Input Output 
Accepted Modalities:
text
LLM NameGPT J 6B Ptmap
Repository ๐Ÿค—https://huggingface.co/baffo32/gpt-j-6B-ptmap 
Model Size6b
Required VRAM24.2 GB
Updated2025-09-23
Maintainerbaffo32
Model Typegptj
Model Files  24.2 GB
Supported Languagesen
Model ArchitectureGPTJForCausalLM
Licenseapache-2.0
Model Max Length1024
Transformers Version4.10.0.dev0
Tokenizer ClassGPT2Tokenizer
Beginning of Sentence Token<|endoftext|>
End of Sentence Token<|endoftext|>
Unk Token<|endoftext|>
Vocabulary Size50400
Activation Functiongelu_new
Errorsreplace

Best Alternatives to GPT J 6B Ptmap

Best Alternatives
Context / RAM
Downloads
Likes
Mlperf GPT J 6B0K / 24.1 GB115950
Deception Normal0K / 12.2 GB60
Deception Filteredpositive0K / 12.2 GB60
Pygmalion 6B0K / 16.3 GB2338751
Gptj Allenai Toxicity Blackbox0K / 12.2 GB90
...j Allenai Toxicity Explainable0K / 12.2 GB70
Pygmalion 6B Roleplay0K / 12.1 GB17802
Gpt4all J0K / 12.2 GB3840299
Test GPT J 6B0K / 2.5 GB100
GPT JT 6B V10K / 12.2 GB9662302
Note: green Score (e.g. "73.2") means that the model is better than baffo32/gpt-j-6B-ptmap.

Rank the GPT J 6B Ptmap Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51534 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124