MiniCPM 2B Sft Fp32 Llama Format Parameters and Internals
Model Type
End-size LLM, Multimodal
Use Cases
Areas:
Research, Commercial applications
Applications:
Multimodal model deployment, Smartphone inference
Primary Use Cases:
End-size language model applications
Limitations:
Model size limitations leading to hallucination issues, Influence of prompt words leading to inconsistent results, Inaccurate knowledge memory due to limited capacity
Additional Notes
MiniCPM models and weights are fully open for research and limited commercial use pending registration.
Training Details
Data Sources:
ShareGPT open-source corpus
Methodology:
SFT and DPO techniques applied
Input Output
Accepted Modalities:
Text, Multimodal inputs
Performance Tips:
Clear data type specification in 'from_pretrained' to avoid large calculation errors
Note: green Score (e.g. "73.2") means that the model is better than openbmb/MiniCPM-2B-sft-fp32-llama-format.
Rank the MiniCPM 2B Sft Fp32 Llama Format Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.