| Model Type | 
 | |||||||||
| Training Details | 
 | 
| LLM Name | MambaByte Books | 
| Repository ๐ค | https://huggingface.co/JunxiongWang/MambaByte_Books | 
| Required VRAM | 1.5 GB | 
| Updated | 2025-09-23 | 
| Maintainer | JunxiongWang | 
| Model Files | |
| Model Architecture | AutoModel | 
| License | apache-2.0 | 
| Vocabulary Size | 256 | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| Distil Longformer Base 4096 | 4K / 0.4 GB | 11 | 0 | 
| Daedalus 1 | 1K / GB | 6 | 1 | 
| Tiny Random Detr | 1K / 0.2 GB | 21 | 0 | 
| Opengpt2 Pytorch Backward | 1K / 6 GB | 20 | 1 | 
| Opengpt2 Pytorch Forward | 1K / 6 GB | 8 | 1 | 
| Finsent Transformer | 0.5K / 0.4 GB | 6 | 1 | 
| Bert Chinese L 12 H 768 A 12 | 0.5K / 0.4 GB | 4 | 1 | 
| Simbert Chinese Base | 0.5K / 0.4 GB | 6 | 0 | 
| Simbert Chinese Tiny | 0.5K / 0 GB | 5 | 0 | 
| Bert Tiny | 0.5K / 0 GB | 9237367 | 129 | 
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐