Yi 6B is an open-source language model by 01-ai. Features: 6b LLM, VRAM: 12.1GB, Context: 4K, License: apache-2.0, HF Score: 54, LLM Explorer Score: 0.23, Arc: 83.8, HellaSwag: 73.1, MMLU: 64, TruthfulQA: 41.9, WinoGrande: 73.8, GSM8K: 39.9.
Yi 6B Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yi 6B Parameters and Internals
Model Type
Use Cases
Areas: research, commercial applications, personal use
Primary Use Cases:
Limitations: May produce hallucinations, Non-determinism in re-generation, Cumulative error potential
Considerations: Adjust generation parameters for diverse responses
Additional Notes Yi is based on Llama architecture but not a derivative; independently trained.
Supported Languages English (high), Chinese (high)
Training Details
Data Sources: multilingual corpus, custom datasets developed by Yi
Data Volume:
Methodology: Supervised Fine-Tuning (SFT) for chat models
Context Length:
Training Time:
Hardware Used: NVIDIA A800, GPU environment
Model Architecture: Transformer-based, similar to Llama
Responsible Ai Considerations
Fairness:
Transparency: Open-source distribution under Apache 2.0
Accountability:
Mitigation Strategies: Uses compliance checking algorithms to maximize data compliance
Input Output
Input Format:
Accepted Modalities:
Output Format:
Performance Tips: Use appropriate generation settings (temperature, top_p) for task diversity
Release Notes
Version:
Date:
Notes: Improved coding, math, reasoning abilities
Quantized Models of the Yi 6B
Best Alternatives to Yi 6B
Expand
Rank the Yi 6B Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum โ our secure, self-hosted AI agent for server management.
Release v20260328a