What are the hardware requirements for Qwen3 4B 2 Sfted Grpo H200 1200step?

Qwen3 4B 2 Sfted Grpo H200 1200step requires approximately 8.1 GB of VRAM and supports a context window of 40K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Qwen3 4B 2 Sfted Grpo H200 1200step and how large is it?

Qwen3 4B 2 Sfted Grpo H200 1200step is developed by quelmap, a model with 4b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Qwen3 4B 2 Sfted Grpo H200 1200step?

Qwen3 4B 2 Sfted Grpo H200 1200step is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Qwen3 4B 2 Sfted Grpo H200 1200step by quelmap — VRAM 8.1GB, 40K context

Name: Qwen3 4B 2 Sfted Grpo H200 1200step
Author: quelmap

Qwen3 4B 2 Sfted Grpo H200 1200step is an open-source language model by quelmap. Features: 4b LLM, VRAM: 8.1GB, Context: 40K, License: apache-2.0, LLM Explorer Score: 0.19.

Autotrain compatible Base model:finetune:fireworks1... Base model:fireworks1231/qwen3... Conversational En Endpoints compatible Qwen3 Region:us Safetensors Sharded Tensorflow Unsloth

Model Card on HF 🤗: https://huggingface.co/fireworks1231/qwen3-4b-2-sfted-grpo-h200-1200step

Qwen3 4B 2 Sfted Grpo H200 1200step Benchmarks

LLME Score: 0.19312

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen3 4B 2 Sfted Grpo H200 1200step (fireworks1231/qwen3-4b-2-sfted-grpo-h200-1200step)

🌟 Advertise your project 🚀

Qwen3 4B 2 Sfted Grpo H200 1200step Parameters and Internals

LLM Name	Qwen3 4B 2 Sfted Grpo H200 1200step
Repository 🤗	https://huggingface.co/fireworks1231/qwen3-4b-2-sfted-grpo-h200-1200step
Base Model(s)	Qwen3 4B Sft Pretrained quelmap/qwen3-4b-sft-pretrained
Model Size	4b
Required VRAM	8.1 GB
Updated	2025-11-12
Maintainer	quelmap
Model Type	qwen3
Model Files	5.0 GB: 1-of-2 3.1 GB: 2-of-2
Supported Languages	en
Model Architecture	Qwen3ForCausalLM
License	apache-2.0
Context Length	40960
Model Max Length	40960
Transformers Version	4.53.0
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|vision_pad\|>
Vocabulary Size	151936
Torch Data Type	bfloat16

Best Alternatives to Qwen3 4B 2 Sfted Grpo H200 1200step

Best Alternatives	Context / RAM	Downloads	Likes
FastContext 1.0 4B SFT	256K / 8.1 GB	5735	355
FastContext 1.0 4B RL	256K / 8.1 GB	3533	59
Qwen3 4B Instruct 2507	256K / 8.1 GB	4833125	869
GRPO 4 70	256K / 8.1 GB	5	0
Qwen3 4B Thinking 2507	256K / 8.1 GB	599357	591
Lightning 4B	256K / 8.1 GB	13	6
Qwen3 4B Instruct 2507 FP8	256K / 5.2 GB	840281	79
Jan V1 4B	256K / 8.1 GB	108354	353
AgentCPM Explore	256K / 8.9 GB	499	415
CyberSecQwen 4B	256K / 8 GB	1216	13

Note: green Score (e.g. "73.2") means that the model is better than fireworks1231/qwen3-4b-2-sfted-grpo-h200-1200step.

Rank the Qwen3 4B 2 Sfted Grpo H200 1200step Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54565 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Qwen3 4B 2 Sfted Grpo H200 1200step by quelmap

» All LLMs » quelmap » Qwen3 4B 2 Sfted Grpo H200 1200step URL Share it on

Qwen3 4B 2 Sfted Grpo H200 1200step Benchmarks

Qwen3 4B 2 Sfted Grpo H200 1200step Parameters and Internals

Best Alternatives to Qwen3 4B 2 Sfted Grpo H200 1200step

Rank the Qwen3 4B 2 Sfted Grpo H200 1200step Capabilities

What open-source LLMs or SLMs are you in search of? 54565 in total.