Kaiju A 57B is an open-source language model by lodrick-the-lafted. Features: 57b LLM, VRAM: 114.7GB, Context: 195K, License: other, HF Score: 63.6, LLM Explorer Score: 0.12, Arc: 58.8, HellaSwag: 81, MMLU: 72.7, TruthfulQA: 52.3, WinoGrande: 78.8, GSM8K: 38.4.
| Model Type |
| ||||||
| Additional Notes |
| ||||||
| Input Output |
|
| LLM Name | Kaiju A 57B |
| Repository ๐ค | https://huggingface.co/lodrick-the-lafted/Kaiju-A-57B |
| Model Size | 57b |
| Required VRAM | 114.7 GB |
| Updated | 2026-04-05 |
| Maintainer | lodrick-the-lafted |
| Model Type | llama |
| Model Files | |
| Model Architecture | LlamaForCausalLM |
| License | other |
| Context Length | 200000 |
| Model Max Length | 200000 |
| Transformers Version | 4.35.0 |
| Tokenizer Class | LlamaTokenizer |
| Padding Token | <unk> |
| Vocabulary Size | 64002 |
| Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| DoubleBagel 57B V1.0 | 195K / 114.1 GB | 3 | 1 |
| Kyllene 57B V1.0 | 195K / 113.3 GB | 25 | 7 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐