Devstral Small 2507 4bit DWQ by mlx-community

 ยป  All LLMs  ยป  mlx-community  ยป  Devstral Small 2507 4bit DWQ   URL Share it on

  4-bit   4bit   Ar Base model:mlx-community/devst... Base model:quantized:mlx-commu...   Bn   Conversational   De   En   Es   Fa   Fr   Hi   Id   It   Ja   Ko   Mistral   Mlx   Ms   Ne   Pl   Pt   Quantized   Region:us   Ro   Ru   Safetensors   Sharded   Sr   Sv   Tensorflow   Tr   Uk   Vi   Zh

Devstral Small 2507 4bit DWQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Devstral Small 2507 4bit DWQ (mlx-community/Devstral-Small-2507-4bit-DWQ)
๐ŸŒŸ Advertise your project ๐Ÿš€

Devstral Small 2507 4bit DWQ Parameters and Internals

LLM NameDevstral Small 2507 4bit DWQ
Repository ๐Ÿค—https://huggingface.co/mlx-community/Devstral-Small-2507-4bit-DWQ 
Base Model(s)  Devstral Samll 2507 Bf16   mlx-community/Devstral-Samll-2507-bf16
Model Size23.6b
Required VRAM13.3 GB
Updated2025-07-19
Maintainermlx-community
Model Typemistral
Model Files  5.3 GB: 1-of-3   5.3 GB: 2-of-3   2.7 GB: 3-of-3
Supported Languagesen fr de es pt it ja ko ru zh ar fa id ms ne pl ro sr sv tr uk vi hi bn
Quantization Type4bit
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length131072
Model Max Length131072
Transformers Version4.53.1
Tokenizer ClassLlamaTokenizerFast
Padding Token<pad>
Vocabulary Size131072
Torch Data Typebfloat16

Best Alternatives to Devstral Small 2507 4bit DWQ

Best Alternatives
Context / RAM
Downloads
Likes
Magistral Small 2506 4bit DWQ32K / 13.3 GB3124
Devstral Samll 2507 Bf16128K / 46.9 GB500
Magistral Small 250640K / 47.3 GB24060
...stral Small 2501 Tensopolis V132K / 47.3 GB60
Magistral Small 2506 Bf1632K / 46.9 GB43610
XortronCriminalComputingConfig32K / 47.3 GB62833
Sandmisthink24BKaalpac32K / 47.3 GB320
...stralThinker V1.1 Reasoner 25632K / 47.3 GB130
Dans LN32K / 47.3 GB131
Dans LN 4432K / 47.3 GB110
Note: green Score (e.g. "73.2") means that the model is better than mlx-community/Devstral-Small-2507-4bit-DWQ.

Rank the Devstral Small 2507 4bit DWQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 49901 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124