Model may sometimes make errors, produce misleading contents, or struggle to manage tasks that are not related to coding. It has undergone very limited testing. Additional safety testing should be performed before any real-world deployments.
Training Details
Data Volume:
0.7 billion high-quality, code-related tokens
Methodology:
Fine-tuning
Hardware Used:
DeepSpeed ZeRO 3, Flash Attention 2
Input Output
Input Format:
Alpaca instruction format (excluding the system prompt)
Note: green Score (e.g. "73.2") means that the model is better than TechxGenus/CodeGemma-7b.
Rank the CodeGemma 7B Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52473 in total.