What are the hardware requirements for Qwen 3 4B Mixture Of Thought?

Qwen 3 4B Mixture Of Thought requires approximately 8.1 GB of VRAM and supports a context window of 40K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Qwen 3 4B Mixture Of Thought and how large is it?

Qwen 3 4B Mixture Of Thought is developed by ertghiu256, a model with 4b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Qwen 3 4B Mixture Of Thought?

Qwen 3 4B Mixture Of Thought is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Qwen 3 4B Mixture Of Thought by ertghiu256 — VRAM 8.1GB, 40K context

Name: Qwen 3 4B Mixture Of Thought
Author: ertghiu256

Qwen 3 4B Mixture Of Thought is an open-source language model by ertghiu256. Features: 4b LLM, VRAM: 8.1GB, Context: 40K, License: apache-2.0, LLM Explorer Score: 0.19.

Base model:finetune:qwen/qwen3... Base model:qwen/qwen3-4b Cot Dataset:open-r1/mixture-of-tho... Dataset:psm24/gemini-2.5-pro-1... Pytorch Qwen3 Reasoning Region:us Sft Sharded Think Trl Unsloth

Model Card on HF 🤗: https://huggingface.co/ertghiu256/qwen-3-4b-mixture-of-thought

Qwen 3 4B Mixture Of Thought Benchmarks

LLME Score: 0.18634

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen 3 4B Mixture Of Thought (ertghiu256/qwen-3-4b-mixture-of-thought)

🌟 Advertise your project 🚀

Qwen 3 4B Mixture Of Thought Parameters and Internals

LLM Name	Qwen 3 4B Mixture Of Thought
Repository 🤗	https://huggingface.co/ertghiu256/qwen-3-4b-mixture-of-thought
Base Model(s)	Qwen/Qwen3-4B Qwen/Qwen3-4B
Model Size	4b
Required VRAM	8.1 GB
Updated	2026-06-18
Maintainer	ertghiu256
Model Type	qwen3
Model Files	5.0 GB: 1-of-2 3.1 GB: 2-of-2
Model Architecture	Qwen3ForCausalLM
License	apache-2.0
Context Length	40960
Model Max Length	40960
Transformers Version	4.51.3
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|vision_pad\|>
Vocabulary Size	151936
Torch Data Type	float16
Errors	replace

Best Alternatives to Qwen 3 4B Mixture Of Thought

Best Alternatives	Context / RAM	Downloads	Likes
Fable Traces	256K / 8.1 GB	5292	207
FastContext 1.0 4B SFT	256K / 8.1 GB	5735	357
Qwen3 4B Instruct 2507	256K / 8.1 GB	4522780	898
GRPO 4 70	256K / 8.1 GB	5	0
FastContext 1.0 4B RL	256K / 8.1 GB	4559	61
Qwen3 4B Thinking 2507	256K / 8.1 GB	529167	601
Lightning 4B	256K / 8.1 GB	13	6
Qwen3 4B Instruct 2507 FP8	256K / 5.2 GB	994800	78
Nexa AI 4B Instruct	256K / 8 GB	436	2
Tac 1	256K / 8.1 GB	635	0

Note: green Score (e.g. "73.2") means that the model is better than ertghiu256/qwen-3-4b-mixture-of-thought.

Rank the Qwen 3 4B Mixture Of Thought Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 55205 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Qwen 3 4B Mixture Of Thought by ertghiu256

» All LLMs » ertghiu256 » Qwen 3 4B Mixture Of Thought URL Share it on

Qwen 3 4B Mixture Of Thought Benchmarks

Qwen 3 4B Mixture Of Thought Parameters and Internals

Best Alternatives to Qwen 3 4B Mixture Of Thought

Rank the Qwen 3 4B Mixture Of Thought Capabilities

What open-source LLMs or SLMs are you in search of? 55205 in total.