Mixtral Instruct AWQ by casperhansen

 ยป  All LLMs  ยป  casperhansen  ยป  Mixtral Instruct AWQ   URL Share it on

  4-bit   Autotrain compatible   Awq   Conversational   Endpoints compatible   Instruct   Mixtral   Quantized   Region:us   Safetensors   Sharded   Tensorflow

Mixtral Instruct AWQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mixtral Instruct AWQ (casperhansen/mixtral-instruct-awq)
๐ŸŒŸ Advertise your project ๐Ÿš€

Mixtral Instruct AWQ Parameters and Internals

Additional Notes 
This is a working version of Mixtral Instruct that is AWQ quantized. The repository is suggested for use as of 11-02-2024, due to another version not working.
LLM NameMixtral Instruct AWQ
Repository ๐Ÿค—https://huggingface.co/casperhansen/mixtral-instruct-awq 
Model Size6.5b
Required VRAM24.7 GB
Updated2025-08-19
Maintainercasperhansen
Model Typemixtral
Instruction-BasedYes
Model Files  10.0 GB: 1-of-3   10.0 GB: 2-of-3   4.7 GB: 3-of-3
AWQ QuantizationYes
Quantization Typeawq
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to Mixtral Instruct AWQ

Best Alternatives
Context / RAM
Downloads
Likes
Dolphin 2.7 Mixtral 8x7b AWQ32K / 24.7 GB445923
Mixtral 8x7B Instruct V0.1 AWQ32K / 24.7 GB50
Mixtral 8x7B Instruct V0.1 AWQ32K / 24.7 GB286357
...xtral Instruct AWQ Clone Dec2332K / 24.7 GB50
...ixtral Instruct 8x7b Zloss AWQ32K / 24.7 GB62
...0.1 LimaRP ZLoss DARE TIES AWQ32K / 24.7 GB63
...Instruct V0.1 LimaRP ZLoss AWQ32K / 24.7 GB51
Dolphin 2.6 Mixtral 8x7b AWQ32K / 24.7 GB4412
...1 Mixtral 8x7b Instruct V3 AWQ32K / 24.7 GB61
...utLM Mixtral 8x7B Instruct AWQ32K / 24.7 GB102

Rank the Mixtral Instruct AWQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50751 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124