Qwen1.5 72B Chat GPTQ is an open-source language model by LoneStriker. Features: 72b LLM, VRAM: 45.4GB, Context: 32K, License: other, Quantized, HF Score: 66, LLM Explorer Score: 0.25, ELO: 1232, Arc: 68.5, HellaSwag: 86.4, MMLU: 77.4, TruthfulQA: 63.9, WinoGrande: 79.1, GSM8K: 20.4.
Qwen1.5 72B Chat GPTQ Benchmarks
nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Qwen1.5 72B Chat GPTQ Parameters and Internals
Model Type text generation, chat model
Additional Notes The beta version does not include GQA and the mixture of SWA and full attention. DPO improves human preference but lowers benchmark evaluation.
Supported Languages en (multilingual capabilities include English)
Training Details
Methodology: Supervised finetuning and direct preference optimization (DPO)
Context Length:
Model Architecture: Transformer architecture with SwiGLU activation, attention QKV bias, group query attention, mixture of sliding window attention and full attention
Input Output
Accepted Modalities:
Performance Tips: Use provided hyper-parameters in 'generation_config.json' for optimal performance.
LLM Name Qwen1.5 72B Chat GPTQ Repository ๐ค https://huggingface.co/LoneStriker/Qwen1.5-72B-Chat-GPTQ Base Model(s) Qwen/Qwen1.5-72B-Chat Qwen/Qwen1.5-72B-Chat Model Size 72b Required VRAM 45.4 GB Updated 2026-04-02 Maintainer LoneStriker Model Type qwen2 Model Files 45.4 GB Supported Languages en GPTQ Quantization Yes Quantization Type gptq|4bit Model Architecture Qwen2ForCausalLM License other Context Length 32768 Model Max Length 32768 Transformers Version 4.37.1 Tokenizer Class Qwen2Tokenizer Padding Token <|endoftext|> Vocabulary Size 152064 Torch Data Type float16 Errors replace
Best Alternatives to Qwen1.5 72B Chat GPTQ
Expand
Rank the Qwen1.5 72B Chat GPTQ Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
Expand
Check out
Ag3ntum โ our secure, self-hosted AI agent for server management.
Release v20260328a