Yi 9B 200K is an open-source language model by 01-ai. Features: 9b LLM, VRAM: 17.7GB, Context: 256K, License: apache-2.0, HF Score: 61.9, LLM Explorer Score: 0.23, Arc: 58, HellaSwag: 78.6, MMLU: 70.3, TruthfulQA: 40.6, WinoGrande: 76.5, GSM8K: 47.6.
Yi 9B 200K Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yi 9B 200K Parameters and Internals
Model Type Chat model, Text generation
Use Cases
Areas: Chat applications, Creative content generation
Applications: Commercial applications, Research, Educational tools
Primary Use Cases: Chatbots, Virtual assistants, Story generation
Limitations: Potential for hallucination, May produce inconsistent outputs
Considerations: Adjust generation parameters for desired output qualities.
Additional Notes Models do not directly use Llama's weights; unique datasets and training infrastructure emphasize Yi's independent development.
Supported Languages English (Fluent), Chinese (Fluent)
Training Details
Data Sources: Trainer Multilingual Corpora, 3T Tokens
Data Volume:
Methodology: Transformer-based architecture
Context Length:
Training Time:
Hardware Used: NVIDIA A800 (80GB), 4090 GPU
Model Architecture: Based on Llama's architecture
Responsible Ai Considerations
Fairness: Addressed during model development.
Transparency: Standard Transformer architecture; detailed in tech report.
Accountability:
Mitigation Strategies: Use of Supervised Fine-Tuning for better accuracy.
Input Output
Input Format: Interactive prompt conversation
Accepted Modalities:
Output Format: Text responses or follow-ups
Performance Tips: Calibrate temperature, top_p, top_k settings for desired response diversity.
Release Notes
Version:
Date:
Notes: Initial open-source release of chat model, supporting both 4-bit and 8-bit quantizations.
Version:
Date:
Notes: Improved performance in coding, math, and reasoning with larger context capabilities.
Quantized Models of the Yi 9B 200K
Best Alternatives to Yi 9B 200K
Note: green Score (e.g. "73.2 ") means that the model is better than 01-ai/Yi-9B-200K .
Expand
Rank the Yi 9B 200K Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a