| LLM Name | DPO Llama3 8B Grammar Rules |
| Repository ๐ค | https://huggingface.co/hannahbillo/dpo-llama3-8b-grammar-rules |
| Base Model(s) | |
| Model Size | 8b |
| Required VRAM | 0.1 GB |
| Updated | 2025-09-23 |
| Maintainer | hannahbillo |
| Model Files | |
| Model Architecture | Adapter |
| License | llama3.1 |
| Model Max Length | 131072 |
| Is Biased | none |
| Tokenizer Class | PreTrainedTokenizerFast |
| Padding Token | <|end_of_text|> |
| PEFT Type | LORA |
| LoRA Model | Yes |
| PEFT Target Modules | q_proj|up_proj|v_proj|o_proj|k_proj|gate_proj|down_proj |
| LoRA Alpha | 32 |
| LoRA Dropout | 0.05 |
| R Param | 6 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ... 3 8B Instruct Bvr Finetune V3 | 8K / 16.1 GB | 7 | 0 |
| Flippa V6 | 0K / 0 GB | 1653 | 1 |
| ...B Instruct DPO 0R100L PoliTune | 0K / 16.1 GB | 6 | 0 |
| Llama 3 Korean 8B R V 0.1 | 0K / 0 GB | 20 | 0 |
| ...B Lora Rag Citation Generation | 0K / 0 GB | 11 | 3 |
| Suavemente 8B R128 LORA | 0K / 2.8 GB | 9 | 0 |
| ...a 8B Rank 128 INSTRUCT Adapter | 0K / 0.7 GB | 18 | 1 |
| ...ning Llama 8B Rank 128 Adapter | 0K / 0.7 GB | 3 | 1 |
| L3 Umbral Mind R128 LoRA | 0K / 0.7 GB | 4 | 3 |
| ...a7 4262 4abb 97b1 1879f340d32e | 0K / 0.3 GB | 5 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐