Qwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos by kevinshin

 ยป  All LLMs  ยป  kevinshin  ยป  Qwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos   URL Share it on

  Alignment-handbook   Autotrain compatible Base model:finetune:kevinshin/... Base model:kevinshin/qwen2.5-1...   Conversational Dataset:kevinshin/wildchat-cre...   Endpoints compatible   Generated from trainer   Qwen2   Region:us   Safetensors   Sft   Sharded   Tensorflow   Trl

Qwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Qwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos (kevinshin/qwen2.5-1.5b-it-rft-sft-wildchat-cw-3k-neg-rethink-pos)
๐ŸŒŸ Advertise your project ๐Ÿš€

Qwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos Parameters and Internals

LLM NameQwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos
Repository ๐Ÿค—https://huggingface.co/kevinshin/qwen2.5-1.5b-it-rft-sft-wildchat-cw-3k-neg-rethink-pos 
Model Nameqwen2.5-1.5b-it-rft-sft-wildchat-cw-3k-neg-rethink-pos
Base Model(s)  kevinshin/qwen2.5-1.5b-it-think-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k   kevinshin/qwen2.5-1.5b-it-think-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k
Model Size1.5b
Required VRAM6.2 GB
Updated2025-09-11
Maintainerkevinshin
Model Typeqwen2
Model Files  5.0 GB: 1-of-2   1.2 GB: 2-of-2   0.0 GB
Model ArchitectureQwen2ForCausalLM
Context Length32768
Model Max Length32768
Transformers Version4.55.0.dev0
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size151936
Torch Data Typefloat32
Errorsreplace

Best Alternatives to Qwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos

Best Alternatives
Context / RAM
Downloads
Likes
ReaderLM V2500K / 3.1 GB22362700
Reader Lm 1.5B250K / 3.1 GB1565605
DeepSeek R1 Distill Qwen 1.5B128K / 3.5 GB5828151328
...n Research Reasoning Qwen 1.5B128K / 7.1 GB171436216
DeepScaleR 1.5B Preview128K / 7.1 GB30523572
AceInstruct 1.5B128K / 3.5 GB6307620
Qwen2.5 1.5B128K / 3.1 GB333474123
OpenReasoning Nemotron 1.5B128K / 3.1 GB274743
...1 Distill Qwen 1.5B GSPO Basic128K / 3.5 GB13370
Qwen2 1.5B128K / 3.1 GB7419297
Note: green Score (e.g. "73.2") means that the model is better than kevinshin/qwen2.5-1.5b-it-rft-sft-wildchat-cw-3k-neg-rethink-pos.

Rank the Qwen2.5 1.5B It Rft Sft Wildchat Cw 3K Neg Rethink Pos Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51316 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124