Yuren 13B Chatml by pleisto

 »  All LLMs  »  pleisto  »  Yuren 13B Chatml   URL Share it on

Yuren 13B Chatml is an open-source language model by pleisto. Features: 13b LLM, VRAM: 26.1GB, Context: 4K, License: llama2, LLM Explorer Score: 0.13, Arc: 53.1, HellaSwag: 78, MMLU: 56.3, GSM8K: 28.1.

  Arxiv:2009.03300 Dataset:b-mc2/sql-create-conte...   Dataset:baai/coig   Dataset:bigcode/the-stack   Dataset:gsm8k   Dataset:mc4   Dataset:niv0   Dataset:openassistant/oasst1 Dataset:pleisto/wikipedia-cn-2...   Dataset:wenhu/theoremqa   Dataset:zjunlp/knowlm-ie   En   Endpoints compatible   Llama   Llama2   Model-index   Pytorch   Region:us   Safetensors   Sharded   Tensorflow   Zh

Yuren 13B Chatml Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Yuren 13B Chatml Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
Information synthesis, Intelligent agents, Natural language understanding, SQL generation
Applications:
Data synthesis, Information extraction, SQL generation, Structured data output
Primary Use Cases:
Enterprise internal data processing, Building intelligent agents for business scenarios
Limitations:
Model is not intended for direct public services, Unforeseen issues may arise due to model complexity
Considerations:
Strongly recommend using within controlled environments with additional security measures.
Additional Notes 
Primarily designed for enterprise internal use rather than public environments.
Supported Languages 
zh (Advanced), en (Advanced)
Training Details 
Data Sources:
bigcode/the-stack, mc4, pleisto/wikipedia-cn-20230720-filtered, gsm8k, OpenAssistant/oasst1, b-mc2/sql-create-context, niv0, BAAI/COIG, wenhu/TheoremQA, zjunlp/KnowLM-IE
Methodology:
Continuously trained based on Llama 2 with a focus on information synthesis and data-centric approaches.
Context Length:
4096
Training Time:
unknown
Model Architecture:
Extended Llama architecture with 13 billion parameters.
Responsible Ai Considerations 
Mitigation Strategies:
Using additional security measures such as input/output filtering, reviewing, or restricting is advised.
Input Output 
Input Format:
Prompts primarily in structured data or natural language queries for text generation models.
Accepted Modalities:
text
Output Format:
SQL statements, structured data responses, standard text completion formats.
Performance Tips:
Use in controlled environments with pre-validated input formats for optimal performance.
Release Notes 
Version:
1.0
Date:
unknown
Notes:
Initial release with training and performance optimizations.
LLM NameYuren 13B Chatml
Repository 🤗https://huggingface.co/pleisto/yuren-13b-chatml 
Model Size13b
Required VRAM26.1 GB
Updated2026-04-23
Maintainerpleisto
Model Typellama
Model Files  10.0 GB: 1-of-3   9.9 GB: 2-of-3   6.2 GB: 3-of-3   10.0 GB: 1-of-3   9.9 GB: 2-of-3   6.2 GB: 3-of-3
Supported Languageszh en
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length4096
Model Max Length4096
Transformers Version4.31.0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size36864
Torch Data Typebfloat16

Best Alternatives to Yuren 13B Chatml

Best Alternatives
Context / RAM
Downloads
Likes
Yarn Llama 2 13B 128K128K / 26 GB1657113
Luminaura RP 13B128K / 26 GB151
Agent Llama2 13B 80K80K / 26.4 GB50
Chat Llama2 13B 80K80K / 52.8 GB110
LongAlign 13B 64K Base64K / 26 GB2843
LongAlign 13B 64K64K / 26 GB5313
LongAlign 13B 64K64K / 26 GB1113
LongAlign 13B 64K Base64K / 26 GB63
Openbuddy Llama2 13B V15p1 64K64K / 26.1 GB74
Yarn Llama 2 13B 64K64K / 26 GB186518

Rank the Yuren 13B Chatml Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 54290 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a