ChanMalion by TehVenom

 ยป  All LLMs  ยป  TehVenom  ยป  ChanMalion   URL Share it on

  Autotrain compatible   Endpoints compatible   Gptj   Pytorch   Region:us   Sharded
Model Card on HF ๐Ÿค—: https://huggingface.co/TehVenom/ChanMalion 

ChanMalion Benchmarks

ChanMalion (TehVenom/ChanMalion)
๐ŸŒŸ Advertise your project ๐Ÿš€

ChanMalion Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
community-driven projects, experimentation
Limitations:
Potential bias due to training data, Not suitable for all conversational contexts
Considerations:
Caution advised in use due to mixed training data origins
Additional Notes 
Given its mixed training corpus, the model exhibits unique behavior influenced by the distinct datasets. The merging of 4Chan and Pygmalion-6b provides a wide range of conversational abilities, but biases from both datasets must be considered.
Training Details 
Data Sources:
4Chan posts, Pygmalion dataset
Methodology:
Intermediate merging of GPT-J trained on 4Chan data with Pygmalion-6b model
Model Architecture:
Transformer-based architecture
LLM NameChanMalion
Repository ๐Ÿค—https://huggingface.co/TehVenom/ChanMalion 
Required VRAM12.3 GB
Updated2025-08-17
MaintainerTehVenom
Model Typegptj
Model Files  2.1 GB: 1-of-6   2.1 GB: 2-of-6   2.1 GB: 3-of-6   2.1 GB: 4-of-6   2.0 GB: 5-of-6   1.9 GB: 6-of-6
Model ArchitectureGPTJForCausalLM
Model Max Length1024
Transformers Version4.25.0.dev0
Tokenizer ClassGPT2Tokenizer
Beginning of Sentence Token<|endoftext|>
End of Sentence Token<|endoftext|>
Unk Token<|endoftext|>
Vocabulary Size50400
Torch Data Typefloat16
Activation Functiongelu_new
Errorsreplace

Best Alternatives to ChanMalion

Best Alternatives
Context / RAM
Downloads
Likes
DiffMerge DollyGPT Pygmalion0K / 12.1 GB19272
...Merge Pygmalion Main Onto V8P40K / 12.3 GB19271
Javalion GPTJ0K / 12.3 GB17931
Skegma GPTJ0K / 12.3 GB17770
Javalion R0K / 12.3 GB17915
Adventien GPTJ0K / 12.3 GB17950
Janin GPTJ0K / 12.3 GB17950
Javelin R0K / 12.3 GB17812
Javelin GPTJ0K / 12.3 GB17854
Janin R0K / 12.3 GB17751
Note: green Score (e.g. "73.2") means that the model is better than TehVenom/ChanMalion.

Rank the ChanMalion Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50723 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124