Long document generation, Work summaries, PPT outlines, Reports, Emails
Limitations:
Possible unseen ethical issues, Need for compliance in use cases
Supported Languages
Chinese (High), English (High)
Training Details
Data Sources:
Web pages, Books, Official Media
Data Volume:
1.5 trillion tokens
Model Architecture:
Decoder-only architecture with improvements like Rotary Embedding for position encoding, use of SwiGLU activation function, and RMSNorm based Pre-Normalization.
Note: green Score (e.g. "73.2") means that the model is better than Tele-AI/telechat-7B-int8.
Rank the Telechat 7B Int8 Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53972 in total.