What are the hardware requirements for Qwen3 4B Thinking 2507 Hermes 3?

Qwen3 4B Thinking 2507 Hermes 3 requires approximately 8.1 GB of VRAM and supports a context window of 256K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Qwen3 4B Thinking 2507 Hermes 3 and how large is it?

Qwen3 4B Thinking 2507 Hermes 3 is developed by ertghiu256, a model with 4b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Qwen3 4B Thinking 2507 Hermes 3?

Qwen3 4B Thinking 2507 Hermes 3 is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Qwen3 4B Thinking 2507 Hermes 3 by ertghiu256 — VRAM 8.1GB, 256K context

Name: Qwen3 4B Thinking 2507 Hermes 3
Author: ertghiu256

Qwen3 4B Thinking 2507 Hermes 3 is an open-source language model by ertghiu256. Features: 4b LLM, VRAM: 8.1GB, Context: 256K, License: apache-2.0, Quantized, LLM Explorer Score: 0.21.

Base model:quantized:qwen/qwen... Base model:qwen/qwen3-4b-think... Conversational Dataset:nousresearch/hermes-3-... En Endpoints compatible Gguf Q5 Quantized Qwen3 Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3

Qwen3 4B Thinking 2507 Hermes 3 Benchmarks

LLME Score: 0.20881

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen3 4B Thinking 2507 Hermes 3 (ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3)

🌟 Advertise your project 🚀

Qwen3 4B Thinking 2507 Hermes 3 Parameters and Internals

LLM Name	Qwen3 4B Thinking 2507 Hermes 3
Repository 🤗	https://huggingface.co/ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3
Base Model(s)	Qwen3 4B Thinking 2507 Qwen/Qwen3-4B-Thinking-2507
Model Size	4b
Required VRAM	8.1 GB
Updated	2026-07-09
Maintainer	ertghiu256
Model Type	qwen3
Model Files	5.0 GB: 1-of-2 3.1 GB: 2-of-2 2.4 GB 2.9 GB 4.3 GB 8.1 GB
Supported Languages	en
GGUF Quantization	Yes
Quantization Type	gguf\|q5\|q5_k
Model Architecture	Qwen3ForCausalLM
License	apache-2.0
Context Length	262144
Model Max Length	262144
Transformers Version	4.55.4
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|vision_pad\|>
Vocabulary Size	151936
Torch Data Type	float16
Errors	replace

Best Alternatives to Qwen3 4B Thinking 2507 Hermes 3

Best Alternatives	Context / RAM	Downloads	Likes
...wen3 4B Toolcalling Gguf Codex	256K / 4.3 GB	3524	54
...B Toolcall Gguf Llamacpp Codex	256K / 4.3 GB	968	7
Qwen3 4B Tcomanr Merge V2.2	256K / 8 GB	95	2
Qwen3 4B Tcomanr Merge V2	256K / 8 GB	27	2
Qwen3 4B 128K GGUF	128K / 1.1 GB	1275	27
Qwen3 4B GGUF	40K / 1.1 GB	131929	229
MiniAI Quata1 4B	40K / 8.1 GB	27	0
Hmanlab Ai V0.1	40K / 8.1 GB	95	1
Qwen3 Hermes 4B	40K / 8.1 GB	121	3
Qwen3 4B GGUF	40K / 1.7 GB	332	6

Note: green Score (e.g. "73.2") means that the model is better than ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3.

Rank the Qwen3 4B Thinking 2507 Hermes 3 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54964 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Qwen3 4B Thinking 2507 Hermes 3 by ertghiu256

» All LLMs » ertghiu256 » Qwen3 4B Thinking 2507 Hermes 3 URL Share it on

Qwen3 4B Thinking 2507 Hermes 3 Benchmarks

Qwen3 4B Thinking 2507 Hermes 3 Parameters and Internals

Best Alternatives to Qwen3 4B Thinking 2507 Hermes 3

Rank the Qwen3 4B Thinking 2507 Hermes 3 Capabilities

What open-source LLMs or SLMs are you in search of? 54964 in total.