Granite 3.0 2B Base is an open-source language model by ibm-granite. Features: 2b LLM, VRAM: 10.1GB, Context: 4K, License: apache-2.0, LLM Explorer Score: 0.23.
summarization, text classification, extraction, question-answering
Primary Use Cases:
baseline to create specialized models
Limitations:
Not undergone any safety alignment, may produce problematic outputs., Potential increased susceptibility to hallucination due to model size.
Considerations:
Community urged to use the model with ethical intentions.
Supported Languages
English (supported), German (supported), Spanish (supported), French (supported), Japanese (supported), Portuguese (supported), Arabic (supported), Czech (supported), Italian (supported), Korean (supported), Dutch (supported), Chinese (supported)
Training Details
Data Sources:
web, code, academic sources, books, math data, multilingual, instruction data
Data Volume:
12 trillion tokens for Stage 1 and 2 trillion tokens for Stage 2
Methodology:
Two-stage training strategy
Context Length:
4096
Hardware Used:
IBM's super computing cluster, Blue Vela, NVIDIA H100 GPUs
Model Architecture:
Decoder-only dense transformer architecture with GQA, RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embeddings
Responsible Ai Considerations
Fairness:
Involves awareness of bias and fairness.
Mitigation Strategies:
Ongoing research to address and mitigate issues.
Input Output
Input Format:
Tokenized input using AutoTokenizer
Accepted Modalities:
text
Output Format:
Decodes output tokens into text using AutoTokenizer
Performance Tips:
Use appropriate libraries and follow examples provided.
Note: green Score (e.g. "73.2") means that the model is better than ibm-granite/granite-3.0-2b-base.
Rank the Granite 3.0 2B Base Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.