Stheno L2 13B 8bit EXL2 by QMB15

 ยป  All LLMs  ยป  QMB15  ยป  Stheno L2 13B 8bit EXL2   URL Share it on

  8bit   Autotrain compatible   En   Endpoints compatible   Exl2   Llama   Pytorch   Quantized   Region:us

Stheno L2 13B 8bit EXL2 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Stheno L2 13B 8bit EXL2 (QMB15/Stheno-L2-13B-8bit-exl2)
๐ŸŒŸ Advertise your project ๐Ÿš€

Stheno L2 13B 8bit EXL2 Parameters and Internals

Model Type 
text generation
Additional Notes 
Quants courtesy of TheBloke. Includes measurement.json for convenience of quantizing to other sizes.
Supported Languages 
en ()
Training Details 
Data Sources:
https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet
Methodology:
Gradient Merge of Stheno-P1 & Stheno-P2
Model Architecture:
Experimental merging of Several Models using Ties-Merge and BlockMerge_Gradient methods
Input Output 
Performance Tips:
Format has been tested in Alpaca format and works well.
LLM NameStheno L2 13B 8bit EXL2
Repository ๐Ÿค—https://huggingface.co/QMB15/Stheno-L2-13B-8bit-exl2 
Model Size13b
Required VRAM10.9 GB
Updated2025-08-18
MaintainerQMB15
Model Typellama
Model Files  10.9 GB
Supported Languagesen
Quantization Typeexl2|8bit
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length4096
Model Max Length4096
Transformers Version4.32.1
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to Stheno L2 13B 8bit EXL2

Best Alternatives
Context / RAM
Downloads
Likes
Llama13b 32K Illumeet Finetune32K / 26 GB50
...Maid V3 13B 32K 6.0bpw H6 EXL232K / 10 GB51
...Maid V3 13B 32K 8.0bpw H8 EXL232K / 13.2 GB51
WhiteRabbitNeo 13B V116K / 26 GB2630425
CodeLlama 13B Python Fp1616K / 26 GB309425
CodeLlama 13B Instruct Fp1616K / 26 GB309828
...Llama 13B Instruct Hf 4bit MLX16K / 7.8 GB11702
CodeLlama 13B Fp1616K / 26 GB1166
Codellama 13B Bnb 4bit16K / 7.2 GB965
Airophin 13B Pntk 16K Fp1616K / 26 GB15164
Note: green Score (e.g. "73.2") means that the model is better than QMB15/Stheno-L2-13B-8bit-exl2.

Rank the Stheno L2 13B 8bit EXL2 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50729 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124