Yuren 13B Chatml by pleisto

 ยป  All LLMs  ยป  pleisto  ยป  Yuren 13B Chatml   URL Share it on

  Arxiv:2009.03300   Autotrain compatible Dataset:b-mc2/sql-create-conte...   Dataset:baai/coig   Dataset:bigcode/the-stack   Dataset:gsm8k   Dataset:mc4   Dataset:niv0   Dataset:openassistant/oasst1 Dataset:pleisto/wikipedia-cn-2...   Dataset:wenhu/theoremqa   Dataset:zjunlp/knowlm-ie   En   Endpoints compatible   Llama   Llama2   Model-index   Pytorch   Region:us   Safetensors   Sharded   Zh

Yuren 13B Chatml Benchmarks

Yuren 13B Chatml (pleisto/yuren-13b-chatml)
๐ŸŒŸ Advertise your project ๐Ÿš€

Yuren 13B Chatml Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
Information synthesis, Intelligent agents, Natural language understanding, SQL generation
Applications:
Data synthesis, Information extraction, SQL generation, Structured data output
Primary Use Cases:
Enterprise internal data processing, Building intelligent agents for business scenarios
Limitations:
Model is not intended for direct public services, Unforeseen issues may arise due to model complexity
Considerations:
Strongly recommend using within controlled environments with additional security measures.
Additional Notes 
Primarily designed for enterprise internal use rather than public environments.
Supported Languages 
zh (Advanced), en (Advanced)
Training Details 
Data Sources:
bigcode/the-stack, mc4, pleisto/wikipedia-cn-20230720-filtered, gsm8k, OpenAssistant/oasst1, b-mc2/sql-create-context, niv0, BAAI/COIG, wenhu/TheoremQA, zjunlp/KnowLM-IE
Methodology:
Continuously trained based on Llama 2 with a focus on information synthesis and data-centric approaches.
Context Length:
4096
Training Time:
unknown
Model Architecture:
Extended Llama architecture with 13 billion parameters.
Responsible Ai Considerations 
Mitigation Strategies:
Using additional security measures such as input/output filtering, reviewing, or restricting is advised.
Input Output 
Input Format:
Prompts primarily in structured data or natural language queries for text generation models.
Accepted Modalities:
text
Output Format:
SQL statements, structured data responses, standard text completion formats.
Performance Tips:
Use in controlled environments with pre-validated input formats for optimal performance.
Release Notes 
Version:
1.0
Date:
unknown
Notes:
Initial release with training and performance optimizations.
LLM NameYuren 13B Chatml
Repository ๐Ÿค—https://huggingface.co/pleisto/yuren-13b-chatml 
Model Size13b
Required VRAM26.1 GB
Updated2025-06-17
Maintainerpleisto
Model Typellama
Model Files  10.0 GB: 1-of-3   9.9 GB: 2-of-3   6.2 GB: 3-of-3
Supported Languageszh en
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length4096
Model Max Length4096
Transformers Version4.31.0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size36864
Torch Data Typebfloat16

Best Alternatives to Yuren 13B Chatml

Best Alternatives
Context / RAM
Downloads
Likes
Luminaura RP 13B128K / 26 GB190
Yarn Llama 2 13B 128K128K / 26 GB1956112
Agent Llama2 13B 80K80K / 26.4 GB260
Chat Llama2 13B 80K80K / 52.8 GB270
LongAlign 13B 64K64K / 26 GB1913
LongAlign 13B 64K Base64K / 26 GB313
Openbuddy Llama2 13B V15p1 64K64K / 26.1 GB194
Openbuddy Llama2 13b64k V1564K / 26.1 GB441
Yarn Llama 2 13B 64K64K / 26 GB162017
Airoboros L2 13B 2.1 YaRN 64K64K / 26 GB487
Note: green Score (e.g. "73.2") means that the model is better than pleisto/yuren-13b-chatml.

Rank the Yuren 13B Chatml Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48225 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124