Devstral Small 2505 Bf16 by mlx-community

 ยป  All LLMs  ยป  mlx-community  ยป  Devstral Small 2505 Bf16   URL Share it on

  Ar Base model:finetune:mistralai/... Base model:mistralai/devstral-...   Bn   Conversational   De   En   Es   Fa   Fr   Hi   Id   It   Ja   Ko   Mistral   Mlx   Ms   Ne   Pl   Pt   Region:us   Ro   Ru   Safetensors   Sharded   Sr   Sv   Tensorflow   Tr   Uk   Vi   Zh

Devstral Small 2505 Bf16 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Devstral Small 2505 Bf16 (mlx-community/Devstral-Small-2505-bf16)
๐ŸŒŸ Advertise your project ๐Ÿš€

Devstral Small 2505 Bf16 Parameters and Internals

LLM NameDevstral Small 2505 Bf16
Repository ๐Ÿค—https://huggingface.co/mlx-community/Devstral-Small-2505-bf16 
Base Model(s)  Devstral Small 2505   mistralai/Devstral-Small-2505
Required VRAM46.9 GB
Updated2025-07-25
Maintainermlx-community
Model Typemistral
Model Files  5.1 GB: 1-of-10   5.2 GB: 2-of-10   5.1 GB: 3-of-10   5.2 GB: 4-of-10   5.2 GB: 5-of-10   5.1 GB: 6-of-10   5.2 GB: 7-of-10   5.2 GB: 8-of-10   4.3 GB: 9-of-10   1.3 GB: 10-of-10
Supported Languagesen fr de es pt it ja ko ru zh ar fa id ms ne pl ro sr sv tr uk vi hi bn
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length131072
Model Max Length131072
Transformers Version4.51.3
Tokenizer ClassLlamaTokenizerFast
Vocabulary Size131072
Torch Data Typebfloat16

Quantized Models of the Devstral Small 2505 Bf16

Model
Likes
Downloads
VRAM
Devstral Small 2505 4bit DWQ437813 GB

Best Alternatives to Devstral Small 2505 Bf16

Best Alternatives
Context / RAM
Downloads
Likes
Krutrim 2 Instruct1000K / 49.3 GB5529
Ft V1 Violet1000K / 24.5 GB200
Tiny Random MistralForCausalLM128K / 0 GB36501
Winterreise M732K / 14.4 GB00
Frostwind V2.1 M732K / 14.4 GB00
...ydaz Web AI Reasoner BaseModel32K / 14.4 GB01
MistralLite32K / 14.4 GB13678432
MistralLite32K / 14.4 GB61777430
Snorkel Mistral PairRM DPO32K / 14.4 GB853107
Tess XS V1.3 Yarn 128K32K / 14.5 GB384413
Note: green Score (e.g. "73.2") means that the model is better than mlx-community/Devstral-Small-2505-bf16.

Rank the Devstral Small 2505 Bf16 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50068 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124