Model Type |
| |||||||||||||||||||||
Use Cases |
| |||||||||||||||||||||
Additional Notes |
| |||||||||||||||||||||
Supported Languages |
| |||||||||||||||||||||
Training Details |
| |||||||||||||||||||||
Input Output |
|
LLM Name | Qwen 7B Chat Int8 |
Repository ๐ค | https://huggingface.co/Qwen/Qwen-7B-Chat-Int8 |
Model Size | 7b |
Required VRAM | 9 GB |
Updated | 2025-08-18 |
Maintainer | Qwen |
Model Type | qwen |
Model Files | |
Supported Languages | zh en |
Model Architecture | QWenLMHeadModel |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.32.0 |
Tokenizer Class | QWenTokenizer |
Vocabulary Size | 151936 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Qwen 7B Chat | 32K / 15.3 GB | 185571 | 782 |
Qwen 7B | 32K / 15.3 GB | 23806 | 384 |
Qwen 7B Chat Int4 | 32K / 5.8 GB | 1888 | 75 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐