LLM Name | Pythia410m DPO Tldr |
Repository ๐ค | https://huggingface.co/mnoukhov/pythia410m-dpo-tldr |
Base Model(s) | |
Required VRAM | 0 GB |
Updated | 2025-07-07 |
Maintainer | mnoukhov |
Model Files | |
Model Architecture | Adapter |
License | apache-2.0 |
Is Biased | none |
Tokenizer Class | GPTNeoXTokenizer |
Padding Token | <|padding|> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | dense|dense_h_to_4h|dense_4h_to_h|query_key_value |
LoRA Alpha | 32 |
LoRA Dropout | 0.05 |
R Param | 16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Nemo Kimi Lora | 0K / 1.8 GB | 15 | 0 |
Nemo Books Lora 4 | 0K / 1.8 GB | 7 | 0 |
Nemo Books Lora | 0K / 1.8 GB | 9 | 0 |
Phi 3 Mini 4K Instruct Sa V0.1 | 0K / 0 GB | 5 | 0 |
Francois KTO Lora | 0K / 0 GB | 11 | 0 |
Francois KTO Lora | 0K / 0 GB | 6 | 0 |
Rei V2 Kto | 0K / 0 GB | 13 | 0 |
...caaaf043da230d9a30d8e0ddcbe879 | 0K / 0.4 GB | 11 | 0 |
...357cade9cc1096cecc35c34dba8992 | 0K / 1.3 GB | 10 | 0 |
Rei V2 Kto | 0K / 0 GB | 5 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐