Tess XS V1.3 Yarn 128K by migtissera

 ยป  All LLMs  ยป  migtissera  ยป  Tess XS V1.3 Yarn 128K   URL Share it on

Tess XS V1.3 Yarn 128K is an open-source language model by migtissera. Features: LLM, VRAM: 14.5GB, Context: 32K, License: apache-2.0, HF Score: 62.5, LLM Explorer Score: 0.15, Arc: 61.6, HellaSwag: 83, MMLU: 62.1, TruthfulQA: 50.2, WinoGrande: 74.7, GSM8K: 43.4.

  Autotrain compatible   Custom code   Endpoints compatible   Mistral   Pytorch   Region:us   Yarn

Tess XS V1.3 Yarn 128K Benchmarks

Tess XS V1.3 Yarn 128K (migtissera/Tess-XS-v1-3-yarn-128K)
๐ŸŒŸ Advertise your project ๐Ÿš€

Tess XS V1.3 Yarn 128K Parameters and Internals

Model Type 
Large Language Model
Use Cases 
Limitations:
Slight repetition noticed around 16K context length.
Considerations:
Recommend testing the model for specific use cases and limiting context length.
Additional Notes 
This model has been tested on context length up to 16K.
Training Details 
Methodology:
General purpose language model trained on the Nous Research Mistral-7B-yarn-128K base.
Context Length:
16000
Input Output 
Input Format:
SYSTEM: USER: ASSISTANT:
Performance Tips:
Test the model to your use case and limit context length to improve performance.
Release Notes 
Version:
Tess-XS-v1.3
Notes:
Stable release. Issues from versions 1.0, 1.1, and 1.2 have been rectified.
LLM NameTess XS V1.3 Yarn 128K
Repository ๐Ÿค—https://huggingface.co/migtissera/Tess-XS-v1-3-yarn-128K 
Required VRAM14.5 GB
Updated2025-09-23
Maintainermigtissera
Model Typemistral
Model Files  14.5 GB
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.35.1
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Tess XS V1.3 Yarn 128K

Best Alternatives
Context / RAM
Downloads
Likes
Krutrim 2 Instruct1000K / 49.3 GB14736
Ft V1 Violet1000K / 24.5 GB50
Mistral Large Instruct 2407128K / 226.7 GB7491859
Tiny Random MistralForCausalLM128K / 0 GB32521
Winterreise M732K / 14.4 GB00
Frostwind V2.1 M732K / 14.4 GB00
MistralLite32K / 14.4 GB11345435
K2S3 V0.132K / 28.7 GB60
MistralLite32K / 14.4 GB61777430
...ydaz Web AI Reasoner BaseModel32K / 14.4 GB01
Note: green Score (e.g. "73.2") means that the model is better than migtissera/Tess-XS-v1-3-yarn-128K.

Rank the Tess XS V1.3 Yarn 128K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52721 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a