| Model Type | 
 | |||||||||||||||
| Use Cases | 
 | |||||||||||||||
| Additional Notes | 
 | |||||||||||||||
| Supported Languages | 
 | |||||||||||||||
| Training Details | 
 | |||||||||||||||
| Input Output | 
 | 
| LLM Name | Qwen 72B Chat Int4 | 
| Repository ๐ค | https://huggingface.co/Qwen/Qwen-72B-Chat-Int4 | 
| Model Size | 72b | 
| Required VRAM | 40.8 GB | 
| Updated | 2025-09-23 | 
| Maintainer | Qwen | 
| Model Type | qwen | 
| Model Files | |
| Supported Languages | zh en | 
| Model Architecture | QWenLMHeadModel | 
| License | other | 
| Context Length | 32768 | 
| Model Max Length | 32768 | 
| Transformers Version | 4.32.0 | 
| Tokenizer Class | QWenTokenizer | 
| Vocabulary Size | 152064 | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| Qwen 72B | 32K / 67.1 GB | 3605 | 358 | 
| Qwen 72B Chat | 32K / 67.1 GB | 688 | 156 | 
| Qwen 72B Chat Int8 | 32K / 77 GB | 35 | 17 | 
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐