Who developed Qwen 4B Thinking Stage3 Grpo Lora and how large is it?

Qwen 4B Thinking Stage3 Grpo Lora is developed by Daniel031203, a model with 4b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Qwen 4B Thinking Stage3 Grpo Lora?

Qwen 4B Thinking Stage3 Grpo Lora is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Qwen 4B Thinking Stage3 Grpo Lora by Daniel031203

Name: Qwen 4B Thinking Stage3 Grpo Lora
Author: Daniel031203

Qwen 4B Thinking Stage3 Grpo Lora is an open-source language model by Daniel031203. Features: 4b LLM, LLM Explorer Score: 0.25.

Arxiv:1910.09700 Base model:adapter:daniel03120... Base model:daniel031203/qwen-4... Conversational Grpo Lora Peft Region:us Safetensors Trl Unsloth

Model Card on HF 🤗: https://huggingface.co/Daniel031203/qwen-4b-thinking-stage3-grpo-lora

Qwen 4B Thinking Stage3 Grpo Lora Benchmarks

LLME Score: 0.24603

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen 4B Thinking Stage3 Grpo Lora (Daniel031203/qwen-4b-thinking-stage3-grpo-lora)

🌟 Advertise your project 🚀

Qwen 4B Thinking Stage3 Grpo Lora Parameters and Internals

LLM Name	Qwen 4B Thinking Stage3 Grpo Lora
Repository 🤗	https://huggingface.co/Daniel031203/qwen-4b-thinking-stage3-grpo-lora
Base Model(s)	Qwen 4B Thinking Stage2 Merged Daniel031203/qwen-4b-thinking-stage2-merged
Model Size	4b
Required VRAM	0 GB
Updated	2026-06-17
Maintainer	Daniel031203
Model Files	0.5 GB 0.3 GB 0.0 GB 0.0 GB
Model Architecture	AutoModel
Model Max Length	262144
Is Biased	none
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|PAD_TOKEN\|>
PEFT Type	LORA
LoRA Model	Yes
PEFT Target Modules	k_proj\|up_proj\|gate_proj\|out_proj\|o_proj\|q_proj\|down_proj\|v_proj
LoRA Alpha	128
LoRA Dropout	0
R Param	64
Errors	replace

Best Alternatives to Qwen 4B Thinking Stage3 Grpo Lora

Best Alternatives	Context / RAM	Downloads	Likes
... 3n 4B It Distill Smollm2 360M	0K / 0 GB	55	0
...istill Haiku Sftv4 Nofilter V2	0K / 0.5 GB	5	0
Qwen3 4B Chunky	0K / 0.3 GB	7	0
Translategemma Tok	0K / 0.2 GB	5	0
Gemma3 Konkani	0K / 0 GB	119	5
Gemma3 Konkani 4B	0K / 0 GB	119	5
AYA Mistral7B Instruct TR 4B	0K / 0.3 GB	0	6
...istill Haiku Sftv4 Nofilter V1	0K / 0.5 GB	30	0
II Search 4B GGUF	0K / 1.7 GB	790	5
...upyter Agent Qwen3 4B AIO GGUF	0K / 1.7 GB	265	4

Note: green Score (e.g. "73.2") means that the model is better than Daniel031203/qwen-4b-thinking-stage3-grpo-lora.

Rank the Qwen 4B Thinking Stage3 Grpo Lora Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 55171 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer