Additional Notes |
| |||
Training Details |
|
LLM Name | Custom Activations GPT KAN |
Repository ๐ค | https://huggingface.co/AISE-TUDelft/Custom-Activations-GPT-KAN |
Required VRAM | 0 GB |
Updated | 2025-09-22 |
Maintainer | AISE-TUDelft |
Model Type | activations_gpt_neo |
Model Files | |
Model Architecture | ActivationsGPTNeoForCausalLM |
Context Length | 512 |
Model Max Length | 512 |
Transformers Version | 4.26.1 |
Vocabulary Size | 10000 |
Torch Data Type | float32 |
Activation Function | gelu_new |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐