Yi 34B is an open-source language model by 01-ai. Features: 34b LLM, VRAM: 68.9GB, Context: 4K, License: apache-2.0, HF Score: 69.4, LLM Explorer Score: 0.29, Arc: 91.9, HellaSwag: 82, MMLU: 76.3, TruthfulQA: 56.2, WinoGrande: 83, GSM8K: 67.9.
Yi 34B Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yi 34B Parameters and Internals
Model Type Chat model, Text generation
Use Cases
Areas: Chat applications, Creative content generation
Applications: Commercial applications, Research, Educational tools
Primary Use Cases: Chatbots, Virtual assistants, Story generation
Limitations: Potential for hallucination, May produce inconsistent outputs
Considerations: Adjust generation parameters for desired output qualities.
Additional Notes Models do not directly use Llama's weights; unique datasets and training infrastructure emphasize Yi's independent development.
Supported Languages English (Fluent), Chinese (Fluent)
Training Details
Data Sources: Trainer Multilingual Corpora, 3T Tokens
Data Volume:
Methodology: Transformer-based architecture
Context Length:
Training Time:
Hardware Used: NVIDIA A800 (80GB), 4090 GPU
Model Architecture: Based on Llama's architecture
Responsible Ai Considerations
Fairness: Addressed during model development.
Transparency: Standard Transformer architecture; detailed in tech report.
Accountability:
Mitigation Strategies: Use of Supervised Fine-Tuning for better accuracy.
Input Output
Input Format: Interactive prompt conversation
Accepted Modalities:
Output Format: Text responses or follow-ups
Performance Tips: Calibrate temperature, top_p, top_k settings for desired response diversity.
Release Notes
Version:
Date:
Notes: Initial open-source release of chat model, supporting both 4-bit and 8-bit quantizations.
Version:
Date:
Notes: Improved performance in coding, math, and reasoning with larger context capabilities.
Quantized Models of the Yi 34B
Best Alternatives to Yi 34B
Expand
Rank the Yi 34B Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum โ our secure, self-hosted AI agent for server management.
Release v20241124