Yi 34B 200K is an open-source language model by 01-ai. Features: 34b LLM, VRAM: 68.9GB, Context: 195K, License: apache-2.0, HF Score: 64, LLM Explorer Score: 0.24, Arc: 65.8, HellaSwag: 82.1, MMLU: 75.6, TruthfulQA: 42.6, WinoGrande: 82.9, GSM8K: 34.9.
Yi 34B 200K Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yi 34B 200K Parameters and Internals
Model Type Chat model, Text generation
Use Cases
Areas: Chat applications, Creative content generation
Applications: Commercial applications, Research, Educational tools
Primary Use Cases: Chatbots, Virtual assistants, Story generation
Limitations: Potential for hallucination, May produce inconsistent outputs
Considerations: Adjust generation parameters for desired output qualities.
Additional Notes Models do not directly use Llama's weights; unique datasets and training infrastructure emphasize Yi's independent development.
Supported Languages English (Fluent), Chinese (Fluent)
Training Details
Data Sources: Trainer Multilingual Corpora, 3T Tokens
Data Volume:
Methodology: Transformer-based architecture
Context Length:
Training Time:
Hardware Used: NVIDIA A800 (80GB), 4090 GPU
Model Architecture: Based on Llama's architecture
Responsible Ai Considerations
Fairness: Addressed during model development.
Transparency: Standard Transformer architecture; detailed in tech report.
Accountability:
Mitigation Strategies: Use of Supervised Fine-Tuning for better accuracy.
Input Output
Input Format: Interactive prompt conversation
Accepted Modalities:
Output Format: Text responses or follow-ups
Performance Tips: Calibrate temperature, top_p, top_k settings for desired response diversity.
Release Notes
Version:
Date:
Notes: Initial open-source release of chat model, supporting both 4-bit and 8-bit quantizations.
Version:
Date:
Notes: Improved performance in coding, math, and reasoning with larger context capabilities.
Quantized Models of the Yi 34B 200K
Best Alternatives to Yi 34B 200K
Expand
Rank the Yi 34B 200K Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum โ our secure, self-hosted AI agent for server management.
Release v20241124