Smol Llama 101M GQA Python is an open-source language model by BEE-spoke-data. Features: 101m LLM, VRAM: 0.4GB, Context: 1K, License: apache-2.0, LLM Explorer Score: 0.1.
struggles with complex reasoning and planning tasks
Considerations:
Use with understanding that it may contain bugs and is best suited for educational or experimental purposes.
Additional Notes
The model is part of an experiment to explore code generation capabilities of smaller models. May create useful utilities but is not optimized for more complex tasks.
Supported Languages
en (basic level)
Training Details
Data Sources:
BEE-spoke-data/pypi_clean-deduped
Methodology:
general pre-trained checkpoint with additional Python-related tokens added to vocab
Training Time:
+1 epoch
Model Architecture:
similar to base model architecture with new tokens
Input Output
Input Format:
text prompts, specifically Python code snippets or descriptions
Accepted Modalities:
text
Output Format:
generated code snippets in Python or README markdown
Performance Tips:
May require specific configurations in tokenizer settings (e.g., use_fast=False).
Note: green Score (e.g. "73.2") means that the model is better than BEE-spoke-data/smol_llama-101M-GQA-python.
Rank the Smol Llama 101M GQA Python Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53972 in total.