Pythia410m DPO Tldr By mnoukhov: Benchmarks, Features and Detailed Analysis. Insights on Pythia410m DPO Tldr.

Adapter Base model:adapter:mnoukhov/py... Base model:mnoukhov/pythia410m... Finetuned Generated from trainer Lora Peft Region:us Safetensors Tensorboard

Model Card on HF 🤗: https://huggingface.co/mnoukhov/pythia410m-dpo-tldr

Pythia410m DPO Tldr Benchmarks

LLME Score: 0.15422

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Pythia410m DPO Tldr (mnoukhov/pythia410m-dpo-tldr)

🌟 Advertise your project 🚀

Pythia410m DPO Tldr Parameters and Internals

LLM Name	Pythia410m DPO Tldr
Repository 🤗	https://huggingface.co/mnoukhov/pythia410m-dpo-tldr
Base Model(s)	Pythia410m Sft Tldr mnoukhov/pythia410m-sft-tldr
Required VRAM	0 GB
Updated	2025-07-07
Maintainer	mnoukhov
Model Files	0.0 GB 0.0 GB
Model Architecture	Adapter
License	apache-2.0
Is Biased	none
Tokenizer Class	GPTNeoXTokenizer
Padding Token	<\|padding\|>
PEFT Type	LORA
LoRA Model	Yes
PEFT Target Modules	dense\|dense_h_to_4h\|dense_4h_to_h\|query_key_value
LoRA Alpha	32
LoRA Dropout	0.05
R Param	16

Best Alternatives to Pythia410m DPO Tldr

Best Alternatives	Context / RAM	Downloads
Nemo Kimi Lora	0K / 1.8 GB	15
Nemo Books Lora 4	0K / 1.8 GB	7
Nemo Books Lora	0K / 1.8 GB	9
Phi 3 Mini 4K Instruct Sa V0.1	0K / 0 GB	5
Francois KTO Lora	0K / 0 GB	11
Francois KTO Lora	0K / 0 GB	6
Rei V2 Kto	0K / 0 GB	13
...caaaf043da230d9a30d8e0ddcbe879	0K / 0.4 GB	11
...357cade9cc1096cecc35c34dba8992	0K / 1.3 GB	10
Rei V2 Kto	0K / 0 GB	5

Note: green Score (e.g. "73.2") means that the model is better than mnoukhov/pythia410m-dpo-tldr.

Rank the Pythia410m DPO Tldr Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 51483 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241124

Support LLM Explorer

Pythia410m DPO Tldr by mnoukhov

» All LLMs » mnoukhov » Pythia410m DPO Tldr URL Share it on

Pythia410m DPO Tldr Benchmarks

Pythia410m DPO Tldr Parameters and Internals

Best Alternatives to Pythia410m DPO Tldr

Rank the Pythia410m DPO Tldr Capabilities

What open-source LLMs or SLMs are you in search of? 51483 in total.