Yuren 13B Chatml is an open-source language model by pleisto. Features: 13b LLM, VRAM: 26.1GB, Context: 4K, License: llama2, LLM Explorer Score: 0.13, Arc: 53.1, HellaSwag: 78, MMLU: 56.3, GSM8K: 28.1.
Yuren 13B Chatml Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yuren 13B Chatml Parameters and Internals
Model Type
Use Cases
Areas: Information synthesis, Intelligent agents, Natural language understanding, SQL generation
Applications: Data synthesis, Information extraction, SQL generation, Structured data output
Primary Use Cases: Enterprise internal data processing, Building intelligent agents for business scenarios
Limitations: Model is not intended for direct public services, Unforeseen issues may arise due to model complexity
Considerations: Strongly recommend using within controlled environments with additional security measures.
Additional Notes Primarily designed for enterprise internal use rather than public environments.
Supported Languages zh (Advanced), en (Advanced)
Training Details
Data Sources: bigcode/the-stack, mc4, pleisto/wikipedia-cn-20230720-filtered, gsm8k, OpenAssistant/oasst1, b-mc2/sql-create-context, niv0, BAAI/COIG, wenhu/TheoremQA, zjunlp/KnowLM-IE
Methodology: Continuously trained based on Llama 2 with a focus on information synthesis and data-centric approaches.
Context Length:
Training Time:
Model Architecture: Extended Llama architecture with 13 billion parameters.
Responsible Ai Considerations
Mitigation Strategies: Using additional security measures such as input/output filtering, reviewing, or restricting is advised.
Input Output
Input Format: Prompts primarily in structured data or natural language queries for text generation models.
Accepted Modalities:
Output Format: SQL statements, structured data responses, standard text completion formats.
Performance Tips: Use in controlled environments with pre-validated input formats for optimal performance.
Release Notes
Version:
Date:
Notes: Initial release with training and performance optimizations.
Best Alternatives to Yuren 13B Chatml
Expand
Rank the Yuren 13B Chatml Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a