Yarn Llama 2 7B 64K GGML by TheBloke

 »  All LLMs  »  TheBloke  »  Yarn Llama 2 7B 64K GGML   URL Share it on

Yarn Llama 2 7B 64K GGML is an open-source language model by TheBloke. Features: 7b LLM, VRAM: 2.9GB, License: llama2, Quantized, LLM Explorer Score: 0.09.

  Arxiv:2309.00071 Base model:finetune:nousresear... Base model:nousresearch/yarn-l...   Dataset:pg19   Ggml   Llama   Quantized   Region:us   Yarn

Yarn Llama 2 7B 64K GGML Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Yarn Llama 2 7B 64K GGML Parameters and Internals

Model Type 
llama
Use Cases 
Areas:
research
Applications:
language modeling, natural language processing
Primary Use Cases:
long context language modeling
Additional Notes 
The model has been quantized by TheBloke to different formats for performance optimization.
Training Details 
Data Sources:
PG19 dataset
Methodology:
Further pretraining for long context support
Context Length:
64000
LLM NameYarn Llama 2 7B 64K GGML
Repository 🤗https://huggingface.co/TheBloke/Yarn-Llama-2-7B-64K-GGML 
Model NameYarn Llama 2 7B 64K
Model CreatorNousResearch
Base Model(s)  Yarn Llama 2 7B 64K   NousResearch/Yarn-Llama-2-7b-64k
Model Size7b
Required VRAM2.9 GB
Updated2026-05-04
MaintainerTheBloke
Model Typellama
Model Files  2.9 GB   3.6 GB   3.3 GB   3.0 GB   3.8 GB   4.2 GB   4.1 GB   3.8 GB   4.7 GB   5.1 GB   4.8 GB   4.7 GB   5.5 GB   7.1 GB
GGML QuantizationYes
Quantization Typeggml
Model ArchitectureAutoModel
Licensellama2

Best Alternatives to Yarn Llama 2 7B 64K GGML

Best Alternatives
Context / RAM
Downloads
Likes
Llama 2 7B Chat GGML0K / 2.9 GB3288873
Llama 2 GGML Medical Chatbot0K /  GB1445
Llama 2 7B GGML0K / 2.9 GB778219
Yarn Llama 2 7B 128K GGML0K / 2.9 GB96
CodeLlama 7B GGML0K / 3 GB1627
CodeLlama 7B Python GGML0K / 2.9 GB1424
CodeLlama 7B Instruct GGML0K / 3 GB1420
Airoboros L2 7B 2.1 GGML0K / 2.9 GB21
Zarafusionex 1.1 L2 7B GGML0K / 2.9 GB82
EDGE 0 7B GGML0K / 2.9 GB31
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Yarn-Llama-2-7B-64K-GGML.

Rank the Yarn Llama 2 7B 64K GGML Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53472 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a