What are the hardware requirements for Qwen3 4B 3 Sfted Grpo H200 1500step?

Qwen3 4B 3 Sfted Grpo H200 1500step requires approximately 8.1 GB of VRAM and supports a context window of 40K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Qwen3 4B 3 Sfted Grpo H200 1500step and how large is it?

Qwen3 4B 3 Sfted Grpo H200 1500step is developed by quelmap, a model with 4b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Qwen3 4B 3 Sfted Grpo H200 1500step?

Qwen3 4B 3 Sfted Grpo H200 1500step is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Qwen3 4B 3 Sfted Grpo H200 1500step by quelmap — VRAM 8.1GB, 40K context

Name: Qwen3 4B 3 Sfted Grpo H200 1500step
Author: quelmap

Qwen3 4B 3 Sfted Grpo H200 1500step is an open-source language model by quelmap. Features: 4b LLM, VRAM: 8.1GB, Context: 40K, License: apache-2.0, LLM Explorer Score: 0.19.

Autotrain compatible Base model:finetune:quelmap/qw... Base model:quelmap/qwen3-4b-sf... Conversational En Endpoints compatible Qwen3 Region:us Safetensors Sharded Tensorflow Unsloth

Model Card on HF 🤗: https://huggingface.co/quelmap/qwen3-4b-3-sfted-grpo-h200-1500step

Qwen3 4B 3 Sfted Grpo H200 1500step Benchmarks

LLME Score: 0.1939

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen3 4B 3 Sfted Grpo H200 1500step (quelmap/qwen3-4b-3-sfted-grpo-h200-1500step)

🌟 Advertise your project 🚀

Qwen3 4B 3 Sfted Grpo H200 1500step Parameters and Internals

LLM Name	Qwen3 4B 3 Sfted Grpo H200 1500step
Repository 🤗	https://huggingface.co/quelmap/qwen3-4b-3-sfted-grpo-h200-1500step
Base Model(s)	Qwen3 4B Sft Pretrained quelmap/qwen3-4b-sft-pretrained
Model Size	4b
Required VRAM	8.1 GB
Updated	2025-07-14
Maintainer	quelmap
Model Type	qwen3
Model Files	5.0 GB: 1-of-2 3.1 GB: 2-of-2
Supported Languages	en
Model Architecture	Qwen3ForCausalLM
License	apache-2.0
Context Length	40960
Model Max Length	40960
Transformers Version	4.53.1
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|vision_pad\|>
Vocabulary Size	151936
Torch Data Type	bfloat16

Best Alternatives to Qwen3 4B 3 Sfted Grpo H200 1500step

Best Alternatives	Context / RAM	Downloads	Likes
Fable Traces	256K / 8.1 GB	277	157
FastContext 1.0 4B SFT	256K / 8.1 GB	5735	357
Qwen3 4B Instruct 2507	256K / 8.1 GB	5376006	894
FastContext 1.0 4B RL	256K / 8.1 GB	4559	61
GRPO 4 70	256K / 8.1 GB	5	0
Qwen3 4B Thinking 2507	256K / 8.1 GB	561880	600
Lightning 4B	256K / 8.1 GB	13	6
Qwen3 4B Instruct 2507 FP8	256K / 5.2 GB	840281	79
AgentCPM Explore	256K / 8.9 GB	499	415
Qwen3 4B Thinking 2507 FP8	256K / 5.2 GB	177981	66

Note: green Score (e.g. "73.2") means that the model is better than quelmap/qwen3-4b-3-sfted-grpo-h200-1500step.

Rank the Qwen3 4B 3 Sfted Grpo H200 1500step Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54868 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Qwen3 4B 3 Sfted Grpo H200 1500step by quelmap

» All LLMs » quelmap » Qwen3 4B 3 Sfted Grpo H200 1500step URL Share it on

Qwen3 4B 3 Sfted Grpo H200 1500step Benchmarks

Qwen3 4B 3 Sfted Grpo H200 1500step Parameters and Internals

Best Alternatives to Qwen3 4B 3 Sfted Grpo H200 1500step

Rank the Qwen3 4B 3 Sfted Grpo H200 1500step Capabilities

What open-source LLMs or SLMs are you in search of? 54868 in total.