LLM Name | 7b DPO Iter1 4e7 Step200 Fromsftepoch2 |
Repository ๐ค | https://huggingface.co/1231czx/7b_dpo_iter1_4e7_step200_fromsftepoch2 |
Model Size | 7b |
Required VRAM | 17.1 GB |
Updated | 2025-09-18 |
Maintainer | 1231czx |
Model Type | gemma |
Model Files | |
Model Architecture | GemmaForCausalLM |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.41.1 |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
Vocabulary Size | 256000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Kaggle Math Model Gemma V1 | 12K / 17.1 GB | 5 | 0 |
Gemma 1.1 7B It | 8K / 17.1 GB | 13535 | 275 |
SeaLLM 7B V2.5 | 8K / 17.1 GB | 13066 | 50 |
Zephyr 7B Gemma DPO Avg | 8K / 17.1 GB | 15 | 0 |
Zephyr 7B Gemma Rpo Avg | 8K / 17.1 GB | 6 | 0 |
Codegemma 7B | 8K / 17.1 GB | 52677 | 201 |
Zephyr 7B Gemma V0.1 | 8K / 17.1 GB | 1180 | 123 |
... Codegemma 2 7B It Alpaca V1.3 | 8K / 17.1 GB | 3 | 1 |
... 7B Finetuned Sft Navarasa 2.0 | 8K / 34 GB | 1218 | 22 |
Codegemma 7B It | 8K / 17.1 GB | 5133 | 231 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐