OrcaMaid V2 FIX 13B 32K GPTQ by TheBloke

 »  All LLMs  »  TheBloke  »  OrcaMaid V2 FIX 13B 32K GPTQ   URL Share it on

OrcaMaid V2 FIX 13B 32K GPTQ is an open-source language model by TheBloke. Features: 13b LLM, VRAM: 7.3GB, Context: 32K, License: other, Quantized, LLM Explorer Score: 0.11.

  4-bit Base model:ddh0/orcamaid-v2-fi... Base model:quantized:ddh0/orca...   Custom code   Gptq   Llama   Quantized   Region:us   Safetensors

OrcaMaid V2 FIX 13B 32K GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

OrcaMaid V2 FIX 13B 32K GPTQ Parameters and Internals

Model Type 
llama
Use Cases 
Considerations:
The prompt format is Alpaca and should be modified for specific needs.
Additional Notes 
This model is a gradient SLERP merge of Microsoft's Orca-2-13b and Undi and IkariDev's Noromaid-v0.1.1-13b, biased towards Orca. Extended context length to 32768 via YaRN.
Input Output 
Input Format:
Below is an instruction that describes a task. Write a response that appropriately completes the request. ### Instruction: {prompt} ### Response:
Output Format:
Generated text based on instruction.
LLM NameOrcaMaid V2 FIX 13B 32K GPTQ
Repository 🤗https://huggingface.co/TheBloke/OrcaMaid-v2-FIX-13B-32k-GPTQ 
Model NameOrcamaid V2 Fix 13B 32K
Model Creatorddh0
Base Model(s)  OrcaMaid V2 FIX 13B 32K   ddh0/OrcaMaid-v2-FIX-13b-32k
Model Size13b
Required VRAM7.3 GB
Updated2026-04-21
MaintainerTheBloke
Model Typellama
Model Files  7.3 GB
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length32768
Model Max Length32768
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to OrcaMaid V2 FIX 13B 32K GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
Yarn Llama 2 13B 128K GPTQ128K / 7.3 GB716
LongAlign 13B 64K GPTQ64K / 7.3 GB51
...boros L2 13B 2 1 YaRN 64K GPTQ64K / 7.3 GB133
Yarn Llama 2 13B 64K GPTQ64K / 7.3 GB201
OrcaMaid V3 13B 32K GPTQ32K / 7.3 GB163
EverythingLM 13B 16K GPTQ16K / 7.3 GB2814
LlongOrca 13B 16K GPT16K / 7.3 GB110
Tinybra 13B GPTQ 32g 4BIT16K / 8 GB31
Tinybra 13B GPTQ 4BIT16K / 7 GB50
WhiteRabbitNeo 13B GPTQ16K / 7.3 GB824
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/OrcaMaid-v2-FIX-13B-32k-GPTQ.

Rank the OrcaMaid V2 FIX 13B 32K GPTQ Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53205 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a