LLM Name | Sakura 13B LNovel V0.8 8bit |
Repository ๐ค | https://huggingface.co/SakuraLLM/Sakura-13B-LNovel-v0_8-8bit |
Model Size | 13b |
Required VRAM | 9.1 GB |
Updated | 2025-07-01 |
Maintainer | SakuraLLM |
Model Type | baichuan |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq|8bit|4bit |
Model Architecture | BaichuanForCausalLM |
License | apache-2.0 |
Model Max Length | 4096 |
Transformers Version | 4.33.2 |
Tokenizer Class | BaichuanTokenizer |
Beginning of Sentence Token | <s> |
End of Sentence Token | </s> |
Unk Token | <unk> |
Vocabulary Size | 125696 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Sakura 13B LNovel V0.8 4bit | 0K / 9.1 GB | 58 | 2 |
Sakura 13B LNovel V0.8 3bit | 0K / 7.5 GB | 34 | 1 |
Baichuan2 13B Chat GPTQ | 0K / 9.1 GB | 40 | 21 |
Baichuan2 13B Chat GPTQ Int4 | 0K / 9.1 GB | 28 | 2 |
...aichuan2 13B Chat Gptq 32g Act | 0K / 9.9 GB | 17 | 1 |
Baichuan 13B Instruction GPTQ | 0K / 7.9 GB | 18 | 4 |
Baichuan 13B Chat 8bit | 0K / 14.1 GB | 23 | 9 |
Tiny Random Baichuan2 13B | 0K / 0.1 GB | 97150 | 0 |
Baichuan2 13B Chat | 0K / 27.8 GB | 7232 | 427 |
Baichuan 13B Chat | 0K / 26.5 GB | 4801 | 631 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐