EleutherAI Polyglot Ko 12.8B 4bits is an open-source language model by RichardErkhov. Features: 12.8b LLM, VRAM: 7.7GB, Context: 2K, License: apache-2.0, LLM Explorer Score: 0.13.
EleutherAI Polyglot Ko 12.8B 4bits Parameters and Internals
Model Type
autoregressive, language model
Use Cases
Areas:
Research, Commercial Applications
Applications:
Text generation, Language comprehension, Model evaluation
Primary Use Cases:
Next token prediction in Korean
Limitations:
Model may not produce the most factual or accurate responses and can produce offensive content.
Considerations:
Use with appropriate filtering mechanisms for sensitive content.
Supported Languages
ko (Full)
Training Details
Data Sources:
Korean blog posts, Korean news dataset, Modu corpus, Korean patent dataset, Korean Q & A dataset, KcBert dataset, Korean fiction dataset, Korean online comments, Korean wikipedia, Clova call, Naver sentiment movie corpus, Korean hate speech dataset, Open subtitles, AIHub various tasks datasets, Standard Korean language dictionary
Data Volume:
863 GB (1.2TB before processing)
Methodology:
Trained for 167 billion tokens over 301,000 steps using GPT-NeoX framework with cross-entropy loss.
Context Length:
2048
Hardware Used:
256 A100 GPUs
Model Architecture:
40 transformer layers, model dimension 5120, feedforward dimension 20480, 40 heads of dimension 128, Rotary Position Embedding applied to 64 dimensions.
Responsible Ai Considerations
Fairness:
Polyglot-Ko may produce socially unacceptable or offensive content.
Transparency:
Open-source release with citation information provided.
Accountability:
Human curation recommended to filter sensitive content.
Mitigation Strategies:
Masking of personally identifiable information (PII) in the pre-processing stage.
Input Output
Input Format:
Text prompt in Korean
Accepted Modalities:
text
Output Format:
Text generation
Performance Tips:
Ensure suitable hardware for large model execution and sufficient memory capacity.
Rank the EleutherAI Polyglot Ko 12.8B 4bits Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53310 in total.