limited to pre-defined languages for high proficiency
Considerations:
Supports long-context processing suitable for extended content applications.
Additional Notes
Utilizes advanced capabilities like webpage browsing and tool execution.
Supported Languages
zh (high), en (high), ja (medium), ko (medium), de (medium), other_langs (support for 26 languages, including extended support for large context lengths (up to 1 million).)
Training Details
Data Sources:
N/A
Data Volume:
N/A
Methodology:
Pre-trained on a diverse set of tasks including semantic, mathematical, reasoning, code, and knowledge datasets.
Context Length:
128000
Training Time:
N/A
Hardware Used:
N/A
Model Architecture:
Designed to support advanced functions like web browsing, code execution, custom tool calls, and long-text reasoning.
Input Output
Input Format:
JSON-like structures for dialogue with role-content mapping.
Accepted Modalities:
text
Output Format:
Text
Performance Tips:
Ensure software dependencies are fully updated and compatible with the specified versions.
Release Notes
Version:
2024/08/12
Date:
2024/08/12
Notes:
Updated to use transformers>=4.44.0.
Version:
2024/07/24
Date:
2024/07/24
Notes:
Released latest technical insights related to long-text processing support up to 1M.
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.