Polyglot-Ko may not always return the most factual or accurate response., Model may produce socially unacceptable or offensive content.
Considerations:
Human curation or filtering mechanism is recommended to censor sensitive content.
Additional Notes
Polyglot-Ko may produce socially unacceptable or offensive content.
Supported Languages
Korean (high)
Training Details
Data Sources:
Korean blog posts, Korean news dataset, Modu corpus, Korean patent dataset, Korean Q & A dataset, KcBert dataset, Korean fiction dataset, Korean online comments, Korean wikipedia, Clova call, Naver sentiment movie corpus, Korean hate speech dataset, Open subtitles, AIHub various tasks datasets, Standard Korean language dictionary
Data Volume:
863 GB (1.2TB before processing)
Methodology:
Trained with cross-entropy loss to maximize the likelihood of predicting the next token. Used EleutherAI GPT-NeoX framework.
Context Length:
2048
Training Time:
301,000 steps
Hardware Used:
256 A100 GPUs
Model Architecture:
40 transformer layers, model dimension 5120, feedforward dimension 20480, 40 heads, head dimension 128, RoPE applied to 64 dimensions of each head. Tokenization vocabulary of 30003.
Note: green Score (e.g. "73.2") means that the model is better than EleutherAI/polyglot-ko-12.8b.
Rank the Polyglot Ko 12.8B Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52392 in total.