| Model Type |
| |||
| Use Cases |
| |||
| Additional Notes |
| |||
| Input Output |
|
| LLM Name | Diff Starcoder 7B Rl |
| Repository ๐ค | https://huggingface.co/vdaita/diff-starcoder-7b-rl |
| Model Size | 7b |
| Required VRAM | 0.1 GB |
| Updated | 2025-02-14 |
| Maintainer | vdaita |
| Model Files | |
| Model Architecture | AutoModel |
| License | apache-2.0 |
| Is Biased | none |
| Tokenizer Class | GPT2Tokenizer |
| Padding Token | <|endoftext|> |
| Vocabulary Size | 49152 |
| PEFT Type | LORA |
| LoRA Model | Yes |
| PEFT Target Modules | k_proj|o_proj|q_proj|down_proj|v_proj|gate_proj|up_proj |
| LoRA Alpha | 32 |
| LoRA Dropout | 0.05 |
| R Param | 16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| TroL 7B | 32K / 17.3 GB | 9 | 7 |
| MoAI 7B | 32K / 17.7 GB | 388 | 45 |
| CoLLaVO 7B | 32K / 18.6 GB | 14 | 21 |
| ... 7b 448 Qinstruct Preview V0.1 | 2K / 17.3 GB | 24 | 4 |
| Janus Pro 7B | 0K / 14.8 GB | 192409 | 3503 |
| Autotrain Z7uyk Cwqtz | 0K / 0.2 GB | 7 | 0 |
| Qwen 2.5 7B 1M RRP V1 Lora | 0K / 0.2 GB | 0 | 3 |
| ...2.5 7B Instruct Abliterated V3 | 0K / 0.2 GB | 0 | 1 |
| Medical Mixtral 7B V2k | 0K / 0.4 GB | 8 | 0 |
| Silicon Natsuki 7B | 0K / 14.4 GB | 6 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐