Audio analysis, Sound understanding, Music appreciation, Speech editing
Primary Use Cases:
Multimodal understanding, Sound analysis, Multi-turn dialogues, Multi-language support
Additional Notes
The model is designed to operate efficiently with diverse audio and text inputs, and aims to serve as a universal audio understanding model based on Alibaba Cloud's large model series.
Supported Languages
languages_supported (/ languages: zh, en)
Training Details
Methodology:
Multimodal and multi-task pre-training with a multi-task training framework.
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.