LLM Name | G3 12B Pt Story Qlora |
Repository ๐ค | https://huggingface.co/ToastyPigeon/g3-12b-pt-story-qlora |
Base Model(s) | |
Model Size | 12b |
Required VRAM | 0.5 GB |
Updated | 2025-04-22 |
Maintainer | ToastyPigeon |
Model Files | |
Model Architecture | Adapter |
License | gemma |
Model Max Length | 131072 |
Is Biased | none |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | down_proj|q_proj|gate_proj|o_proj|k_proj|up_proj|v_proj |
LoRA Alpha | 64 |
LoRA Dropout | 0.5 |
R Param | 64 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Noodles 12B | 0K / 3.6 GB | 11 | 0 |
Beef 12B | 0K / 3.6 GB | 5 | 0 |
CEV Other 12B Lora | 0K / 0.1 GB | 5 | 0 |
Meme 12B Lora E2 | 0K / 0.2 GB | 8 | 0 |
Meme 12B Lora E2 | 0K / 0.2 GB | 5 | 0 |
Control 12B R3 Lora | 0K / 2.9 GB | 6 | 0 |
...Noctis R64 Test Train Lora 12B | 0K / 0.9 GB | 7 | 0 |
...Noctis R64 Test Train Lora 12B | 0K / 0.9 GB | 5 | 0 |
Aurora 12B | 0K / 24.5 GB | 8 | 0 |
Aura NeMo 12B | 0K / 1.8 GB | 6 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐