Falcon2 5.5B Danish is an open-source language model by ssmits. Features: 11b LLM, VRAM: 10.9GB, Context: 8K, License: apache-2.0, Merged, LLM Explorer Score: 0.13.
research, foundation for further specialization, fine-tuning
Applications:
summarization, text generation, chatbot
Primary Use Cases:
Quantum computing concept explanation
Limitations:
limited generalization to non-model-specific languages
Considerations:
Requires adequate assessment and mitigation of risks for production use.
Additional Notes
The merge method ensures layer similarity with Danish text inputs.
Supported Languages
da (primary), en (limited), de (limited), es (limited), fr (limited), it (limited), pt (limited), pl (limited), nl (limited), ro (limited), cs (limited), sv (limited)
Training Details
Data Sources:
wikimedia/wikipedia Danish subset
Data Volume:
5T tokens
Methodology:
Continued pre-training with pruning strategy using PruneMe
Model Architecture:
Pruned model layers [0, 25] and [56, 59] undergo passthrough merge method
Responsible Ai Considerations
Fairness:
The model may carry stereotypes and biases encountered online.
Mitigation Strategies:
Recommend finetuning for specific tasks and precautions in production use.
Note: green Score (e.g. "73.2") means that the model is better than ssmits/Falcon2-5.5B-Danish.
Rank the Falcon2 5.5B Danish Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53999 in total.