Yarn Llama 2 70B 32K GGUF is an open-source language model by TheBloke. Features: 70b LLM, VRAM: 29.3GB, License: apache-2.0, Quantized, LLM Explorer Score: 0.11.
| LLM Name | Yarn Llama 2 70B 32K GGUF |
| Repository ๐ค | https://huggingface.co/TheBloke/Yarn-Llama-2-70B-32k-GGUF |
| Model Name | Yarn Llama 2 70B 32K |
| Model Creator | NousResearch |
| Base Model(s) | |
| Model Size | 70b |
| Required VRAM | 29.3 GB |
| Updated | 2026-03-29 |
| Maintainer | TheBloke |
| Model Type | llama |
| Model Files | |
| Supported Languages | en |
| GGUF Quantization | Yes |
| Context Length | 8k |
| Quantization Type | gguf |
| Model Architecture | AutoModel |
| License | apache-2.0 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ...us Qwen3 R1 Llama Distill GGUF | 0K / 0.8 GB | 130 | 2 |
| ...gekit Passthrough Yqhuxcv GGUF | 0K / 16.9 GB | 110 | 0 |
| KafkaLM 70B German V0.1 GGUF | 0K / 25.5 GB | 2402 | 56 |
| CodeLlama 70B Instruct GGUF | 0K / 25.5 GB | 2159 | 60 |
| CodeLlama 70B Python GGUF | 0K / 25.5 GB | 1596 | 44 |
| Meta Llama 3 70B Instruct GGUF | 0K / 26.4 GB | 133 | 4 |
| DAD Model V2 70B Q4 | 0K / 42.5 GB | 8 | 0 |
| CodeLlama 70B Hf GGUF | 0K / 25.5 GB | 558 | 42 |
| Llama 2 70B Guanaco QLoRA GGUF | 0K / 29.3 GB | 19 | 0 |
| Swallow 70B Instruct GGUF | 0K / 29.4 GB | 1550 | 9 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐