Nous Hermes 13B GPTQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Nous Hermes 13B GPTQ   URL Share it on

Nous Hermes 13B GPTQ is an open-source language model by TheBloke. Features: 13b LLM, VRAM: 7.5GB, Context: 2K, License: other, Quantized, HF Score: 54, LLM Explorer Score: 0.11, Arc: 56.6, HellaSwag: 82.1, MMLU: 50.4, TruthfulQA: 51.5, WinoGrande: 75.3, GSM8K: 8.3.

  4-bit   Deploy:azure   Distillation   En   Gptq   Llama   Quantized   Region:us   Safetensors   Self-instruct

Nous Hermes 13B GPTQ Benchmarks

Nous Hermes 13B GPTQ (TheBloke/Nous-Hermes-13B-GPTQ)
๐ŸŒŸ Advertise your project ๐Ÿš€

Nous Hermes 13B GPTQ Parameters and Internals

Model Type 
language model, text generation
Use Cases 
Areas:
research, commercial applications
Primary Use Cases:
long response generation, low hallucination generation
Limitations:
not specified
Additional Notes 
Benchmarks are pending. Compute provided by Redmond AI.
Supported Languages 
en (high)
Training Details 
Data Sources:
GPTeacher, general roleplay v1&2, code instruct datasets, Nous Instruct & PDACTL, CodeAlpaca, Evol_Instruct Uncensored, GPT4-LLM, Unnatural Instructions, Camel-AI's Biology/Physics/Chemistry and Math Datasets, Airoboros' GPT-4 Dataset
Data Volume:
300,000 instructions
Methodology:
Fine-tuned on synthetic GPT-4 outputs; sequence length of 2000.
Context Length:
2000
Training Time:
50+ hours on an 8x a100 80GB DGX machine
Hardware Used:
8x a100 80GB DGX machine
Model Architecture:
Enhanced Llama 13b model through fine-tuning.
Input Output 
Input Format:
Alpaca prompt format
Accepted Modalities:
text
Output Format:
Textual responses
Release Notes 
Version:
GPTQ 4bit
Notes:
Quantisation to 4bit using GPTQ-for-LLaMa.
Version:
FP16
Notes:
Model uploaded in FP16 format.
LLM NameNous Hermes 13B GPTQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/Nous-Hermes-13B-GPTQ 
Base Model(s)  Nous Hermes 13B   NousResearch/Nous-Hermes-13b
Model Size13b
Required VRAM7.5 GB
Updated2026-01-02
MaintainerTheBloke
Model Typellama
Model Files  7.5 GB
Supported Languagesen
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length2048
Model Max Length2048
Transformers Version4.29.2
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32001
Torch Data Typebfloat16

Best Alternatives to Nous Hermes 13B GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
Yarn Llama 2 13B 128K GPTQ128K / 7.3 GB1716
LongAlign 13B 64K GPTQ64K / 7.3 GB51
...boros L2 13B 2 1 YaRN 64K GPTQ64K / 7.3 GB123
Yarn Llama 2 13B 64K GPTQ64K / 7.3 GB31
OrcaMaid V3 13B 32K GPTQ32K / 7.3 GB123
OrcaMaid V2 FIX 13B 32K GPTQ32K / 7.3 GB64
EverythingLM 13B 16K GPTQ16K / 7.3 GB2614
LlongOrca 13B 16K GPT16K / 7.3 GB70
Tinybra 13B GPTQ 32g 4BIT16K / 8 GB11
Tinybra 13B GPTQ 4BIT16K / 7 GB50
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Nous-Hermes-13B-GPTQ.

Rank the Nous Hermes 13B GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52473 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a