| Model Type |
| |||||||||||||||
| Use Cases |
| |||||||||||||||
| Additional Notes |
| |||||||||||||||
| Supported Languages |
| |||||||||||||||
| Training Details |
| |||||||||||||||
| Input Output |
| |||||||||||||||
| Release Notes |
|
| LLM Name | Nanbeige 16B Base 32K GGUF |
| Repository ๐ค | https://huggingface.co/TheBloke/Nanbeige-16B-Base-32K-GGUF |
| Model Name | Nanbeige 16B Base 32K |
| Model Creator | Nanbeige LLM Lab |
| Base Model(s) | |
| Model Size | 16b |
| Required VRAM | 6.6 GB |
| Updated | 2025-09-23 |
| Maintainer | TheBloke |
| Model Type | nanbeige |
| Model Files | |
| Supported Languages | en zh |
| GGUF Quantization | Yes |
| Quantization Type | gguf |
| Model Architecture | AutoModel |
| License | apache-2.0 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Tinyllama Gguf 16B | 0K / 2.2 GB | 5 | 0 |
| Llama 3 16B Instruct V0.1 GGUF | 0K / 6.4 GB | 503 | 8 |
| Nanbeige 16B Chat 32K GGUF | 0K / 6.6 GB | 453 | 6 |
| Nanbeige 16B Base GGUF | 0K / 6.6 GB | 310 | 1 |
| Nanbeige 16B Chat GGUF | 0K / 6.6 GB | 284 | 1 |
| Ct2fast Codegen2 16B | 0K / 32.1 GB | 5 | 1 |
| Ct2fast Codegen 16B Mono | 0K / 32.1 GB | 6 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐