Qwen 4B Thinking Stage3 Grpo Lora is an open-source language model by Daniel031203. Features: 4b LLM, LLM Explorer Score: 0.27.
| LLM Name | Qwen 4B Thinking Stage3 Grpo Lora |
| Repository ๐ค | https://huggingface.co/Daniel031203/qwen-4b-thinking-stage3-grpo-lora |
| Base Model(s) | |
| Model Size | 4b |
| Required VRAM | 0 GB |
| Updated | 2026-06-01 |
| Maintainer | Daniel031203 |
| Model Files | |
| Model Architecture | AutoModel |
| Model Max Length | 262144 |
| Is Biased | none |
| Tokenizer Class | Qwen2Tokenizer |
| Padding Token | <|PAD_TOKEN|> |
| PEFT Type | LORA |
| LoRA Model | Yes |
| PEFT Target Modules | k_proj|up_proj|gate_proj|out_proj|o_proj|q_proj|down_proj|v_proj |
| LoRA Alpha | 128 |
| LoRA Dropout | 0 |
| R Param | 64 |
| Errors | replace |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ... 3n 4B It Distill Smollm2 360M | 0K / 0 GB | 55 | 0 |
| ...istill Haiku Sftv4 Nofilter V2 | 0K / 0.5 GB | 21 | 0 |
| Qwen3 4B Chunky | 0K / 0.3 GB | 19 | 0 |
| Translategemma Tok | 0K / 0.2 GB | 8 | 0 |
| Gemma3 Konkani | 0K / 0 GB | 119 | 5 |
| Gemma3 Konkani 4B | 0K / 0 GB | 119 | 5 |
| AYA Mistral7B Instruct TR 4B | 0K / 0.3 GB | 0 | 6 |
| ...istill Haiku Sftv4 Nofilter V1 | 0K / 0.5 GB | 15 | 0 |
| II Search 4B GGUF | 0K / 1.7 GB | 790 | 5 |
| ...upyter Agent Qwen3 4B AIO GGUF | 0K / 1.7 GB | 328 | 4 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐