LLM Name | Checkpoints Multiple Datasets Layer 1 Decoder Fixed |
Repository ๐ค | https://huggingface.co/thejaminator/checkpoints_multiple_datasets_layer_1_decoder-fixed |
Base Model(s) | |
Model Size | 8b |
Required VRAM | 0.7 GB |
Updated | 2025-09-16 |
Maintainer | thejaminator |
Model Files | |
Model Architecture | AutoModelForCausalLM |
Model Max Length | 131072 |
Is Biased | none |
Tokenizer Class | Qwen2Tokenizer |
Padding Token | <|endoftext|> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | up_proj|o_proj|v_proj|q_proj|gate_proj|down_proj|k_proj |
LoRA Alpha | 128 |
LoRA Dropout | 0.05 |
R Param | 64 |
Errors | replace |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Llama 3.1 8B Instuct Uz | 128K / 16.1 GB | 1000 | 15 |
Trillama 8B | 8K / 16.1 GB | 4 | 3 |
AutogenJune2000 | 8K / 33.8 GB | 5 | 0 |
Llama3 8B | 8K / 16.1 GB | 5 | 0 |
Medllama3 V20 | 0K / 16.1 GB | 25328 | 78 |
CryptoAI | 0K / 16.1 GB | 86 | 1 |
...llem V1 Smilebase Llama 3.1 8B | 0K / 1.1 GB | 13 | 0 |
Llama3 2 8B To 1B Test | 0K / 2.5 GB | 7 | 0 |
Autotrain Pvqlj Odah2 | 0K / 0.2 GB | 18 | 0 |
500tiao 100lun | 0K / 0.2 GB | 5 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐