Speechless Llama2 13B GGML by TheBloke

 »  All LLMs  »  TheBloke  »  Speechless Llama2 13B GGML   URL Share it on

Speechless Llama2 13B GGML is an open-source language model by TheBloke. Features: 13b LLM, VRAM: 5.5GB, License: llama2, Quantized, Instruction-Based, LLM Explorer Score: 0.09.

  Arxiv:2307.09288 Base model:finetune:uukuguy/sp... Base model:uukuguy/speechless-... Dataset:garage-baind/open-plat...   Dataset:open-orca/openorca Dataset:wizardlm/wizardlm evol...   En   Facebook   Ggml   Instruct   Llama   Llama2   Meta   Pytorch   Quantized   Region:us

Speechless Llama2 13B GGML Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Speechless Llama2 13B GGML Parameters and Internals

Model Type 
llama
Use Cases 
Areas:
Research, Commercial applications
Applications:
Text generation, Dialogue systems
Primary Use Cases:
Chatbot development, Natural language generation tasks
Limitations:
Tested primarily in English, may not cover all scenarios
Considerations:
Developers should perform safety testing tailored to their specific applications.
Additional Notes 
Llama 2 was trained between January 2023 and July 2023.
Training Details 
Data Sources:
Open-Orca/OpenOrca, garage-bAInd/Open-Platypus, WizardLM/WizardLM_evol_instruct_V2_196k
Data Volume:
2 trillion tokens
Methodology:
Merge of Open-Orca/OpenOrca-Platypus2-13B and WizardLM/WizardLM-13B-V1.2
Context Length:
4096
Hardware Used:
A100-80GB GPUs
Model Architecture:
Optimized transformer architecture with fine-tuning using supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)
Input Output 
Input Format:
Text
Accepted Modalities:
text
Output Format:
Text
Performance Tips:
Use GGUF files for compatibility with latest llama.cpp for optimal performance.
Release Notes 
Version:
v1.1
Notes:
A merge of Open-Orca/OpenOrca-Platypus2-13B and WizardLM/WizardLM-13B-V1.2.
LLM NameSpeechless Llama2 13B GGML
Repository 🤗https://huggingface.co/TheBloke/Speechless-Llama2-13B-GGML 
Model NameSpeechless Llama2 13B
Model CreatorJiangwen Su
Base Model(s)  Speechless Llama2 13B   uukuguy/speechless-llama2-13b
Model Size13b
Required VRAM5.5 GB
Updated2026-04-23
MaintainerTheBloke
Model Typellama
Instruction-BasedYes
Model Files  5.5 GB   6.9 GB   6.3 GB   5.7 GB   7.4 GB   8.2 GB   7.9 GB   7.4 GB   9.0 GB   9.8 GB   9.2 GB   9.0 GB   10.7 GB   13.8 GB
Supported Languagesen
GGML QuantizationYes
Quantization Typeggml
Model ArchitectureAutoModel
Licensellama2

Best Alternatives to Speechless Llama2 13B GGML

Best Alternatives
Context / RAM
Downloads
Likes
RuGPT 3.5 13B GGML0K / 7.4 GB108
CodeLlama 13B Instruct GGML0K / 5.7 GB2219
...lama2 13B Orca V2 8K 3166 GGML0K / 5.7 GB214
Vigogne 2 13B Instruct GGML0K / 5.5 GB66
...1.0 Uncensored Llama2 13B GGML0K / 5.5 GB960
...aMa 3 Instruct Zeroed 13B GGUF0K / 5 GB291
Llama 3 13B Instruct V0.1 GGUF0K / 5.1 GB5905
Codellama 7B Instruct GGUF0K / 2.8 GB2011
Codellama 13B Instruct GGUF0K / 13.8 GB370
Law LLM 13B GGUF0K / 5.4 GB119210
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Speechless-Llama2-13B-GGML.

Rank the Speechless Llama2 13B GGML Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53205 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a