EleutherAI Polyglot Ko 12.8B 4bits By RichardErkhov: Benchmarks, Features and Detailed Analysis. Insights on EleutherAI Polyglot Ko 12.8B 4bits.

Arxiv:2104.09864 Arxiv:2204.04541 Arxiv:2306.02254 4-bit Autotrain compatible Bitsandbytes Endpoints compatible Gpt neox Pytorch Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/RichardErkhov/EleutherAI_-_polyglot-ko-12.8b-4bits

EleutherAI Polyglot Ko 12.8B 4bits Benchmarks

LLME Score: 0.15555

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

EleutherAI Polyglot Ko 12.8B 4bits (RichardErkhov/EleutherAI_-_polyglot-ko-12.8b-4bits)

🌟 Advertise your project 🚀

EleutherAI Polyglot Ko 12.8B 4bits Parameters and Internals

Model Type

autoregressive, language model

Use Cases

Areas:

Research, Commercial Applications

Applications:

Text generation, Language comprehension, Model evaluation

Primary Use Cases:

Next token prediction in Korean

Limitations:

Model may not produce the most factual or accurate responses and can produce offensive content.

Considerations:

Use with appropriate filtering mechanisms for sensitive content.

Supported Languages

ko (Full)

Training Details

Data Sources:

Korean blog posts, Korean news dataset, Modu corpus, Korean patent dataset, Korean Q & A dataset, KcBert dataset, Korean fiction dataset, Korean online comments, Korean wikipedia, Clova call, Naver sentiment movie corpus, Korean hate speech dataset, Open subtitles, AIHub various tasks datasets, Standard Korean language dictionary

Data Volume:

863 GB (1.2TB before processing)

Methodology:

Trained for 167 billion tokens over 301,000 steps using GPT-NeoX framework with cross-entropy loss.

Context Length:

2048

Hardware Used:

256 A100 GPUs

Model Architecture:

40 transformer layers, model dimension 5120, feedforward dimension 20480, 40 heads of dimension 128, Rotary Position Embedding applied to 64 dimensions.

Responsible Ai Considerations

Fairness:

Polyglot-Ko may produce socially unacceptable or offensive content.

Transparency:

Open-source release with citation information provided.

Accountability:

Human curation recommended to filter sensitive content.

Mitigation Strategies:

Masking of personally identifiable information (PII) in the pre-processing stage.

Input Output

Input Format:

Text prompt in Korean

Accepted Modalities:

text

Output Format:

Text generation

Performance Tips:

Ensure suitable hardware for large model execution and sufficient memory capacity.

LLM Name	EleutherAI Polyglot Ko 12.8B 4bits
Repository 🤗	https://huggingface.co/RichardErkhov/EleutherAI_-_polyglot-ko-12.8b-4bits
Model Size	12.8b
Required VRAM	7.7 GB
Updated	2025-08-17
Maintainer	RichardErkhov
Model Type	gpt_neox
Model Files	5.0 GB: 1-of-2 2.7 GB: 2-of-2
Supported Languages	ko
Model Architecture	GPTNeoXForCausalLM
License	apache-2.0
Context Length	2048
Model Max Length	2048
Transformers Version	4.39.3
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<\|endoftext\|>
Vocabulary Size	30080
Torch Data Type	float16

Best Alternatives to EleutherAI Polyglot Ko 12.8B 4bits

Best Alternatives	Context / RAM	Downloads	Likes
...pen Platypus Polyglot Ko 12.8B	2K / 51.4 GB	5	0
Polyglot Ko 12.8B Instruct	2K / 25.9 GB	3187	3
Polyglot Ko 12.8B Inst All	2K / 51.4 GB	880	1
Polyglot Ko 12.8B Inst	2K / 51.4 GB	888	1
KoRnDAlpaca RAG Polyglot 12.8B	2K / 51.4 GB	7	0
Koquality Polyglot 12.8B	2K / 51.4 GB	672	0
Ppo2	2K / 25.9 GB	886	0
Gollm 12.8B Instruct V2.3	2K / 25.9 GB	565	0
Kullm Polyglot 12.8B V3	2K / 25.9 GB	7	5
...12.8B Orca Chat QLoRA Merge V2	2K / 25.9 GB	899	0

Rank the EleutherAI Polyglot Ko 12.8B 4bits Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 50728 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241124

Support LLM Explorer

EleutherAI Polyglot Ko 12.8B 4bits by RichardErkhov

» All LLMs » RichardErkhov » EleutherAI Polyglot Ko 12.8B 4bits URL Share it on

EleutherAI Polyglot Ko 12.8B 4bits Benchmarks

EleutherAI Polyglot Ko 12.8B 4bits Parameters and Internals

Best Alternatives to EleutherAI Polyglot Ko 12.8B 4bits

Rank the EleutherAI Polyglot Ko 12.8B 4bits Capabilities

What open-source LLMs or SLMs are you in search of? 50728 in total.