Hermes 4 14B 4bit by mlx-community

 »  All LLMs  »  mlx-community  »  Hermes 4 14B 4bit   URL Share it on

Hermes 4 14B 4bit is an open-source language model by mlx-community. Features: 14b LLM, VRAM: 8.3GB, Context: 40K, License: apache-2.0, Quantized, LLM Explorer Score: 0.23.

  4-bit   4bit   Atropos Base model:nousresearch/hermes... Base model:quantized:nousresea...   Chat   Chatml   Conversational   Dataforge   En   Finetuned   Function calling   Hybrid-mode   Instruct   Json mode   Long context   Mlx   Quantized   Qwen-3-14b   Qwen3   Reasoning   Region:us   Roleplaying   Safetensors   Sharded   Structured outputs   Tensorflow   Tool use

Hermes 4 14B 4bit Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Hermes 4 14B 4bit Parameters and Internals

LLM NameHermes 4 14B 4bit
Repository 🤗https://huggingface.co/mlx-community/Hermes-4-14B-4bit 
Base Model(s)  Hermes 4 14B   NousResearch/Hermes-4-14B
Model Size14b
Required VRAM8.3 GB
Updated2026-05-24
Maintainermlx-community
Model Typeqwen3
Model Files  5.3 GB: 1-of-2   3.0 GB: 2-of-2
Supported Languagesen
Quantization Type4bit
Model ArchitectureQwen3ForCausalLM
Licenseapache-2.0
Context Length40960
Model Max Length40960
Transformers Version4.54.0
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size151936
Torch Data Typebfloat16
Errorsreplace

Best Alternatives to Hermes 4 14B 4bit

Best Alternatives
Context / RAM
Downloads
Likes
Prototie Ai40K / 29.5 GB3460
Qwen3 14B Unsloth Bnb 4bit40K / 11.2 GB20074315
Qwen3 14B MLX 4bit40K / 8.3 GB476785
Qwen3 14B Bnb 4bit40K / 9.9 GB102314
Qwen3 14B MLX 4bit40K / 7.9 GB214013
...wen3 14B Non Thinking V6 16bit40K / 29.5 GB180
Merged16 Sft Qwen340K / 29.5 GB50
Bee1reason Arabic Qwen 14B40K / 29.5 GB5848
Qwen3 14B 4bit DWQ 05312540K / 8.3 GB4326
Qwen3 14B MLX 8bit40K / 15.2 GB5185
Note: green Score (e.g. "73.2") means that the model is better than mlx-community/Hermes-4-14B-4bit.

Rank the Hermes 4 14B 4bit Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53999 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a