Devstral Samll 2507 Bf16 by mlx-community

 ยป  All LLMs  ยป  mlx-community  ยป  Devstral Samll 2507 Bf16   URL Share it on

  Ar Base model:finetune:mistralai/... Base model:mistralai/devstral-...   Bn   Conversational   De   En   Es   Fa   Fr   Hi   Id   It   Ja   Ko   Mistral   Mlx   Ms   Ne   Pl   Pt   Region:us   Ro   Ru   Safetensors   Sharded   Sr   Sv   Tensorflow   Tr   Uk   Vi   Zh

Devstral Samll 2507 Bf16 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Devstral Samll 2507 Bf16 (mlx-community/Devstral-Samll-2507-bf16)
๐ŸŒŸ Advertise your project ๐Ÿš€

Devstral Samll 2507 Bf16 Parameters and Internals

LLM NameDevstral Samll 2507 Bf16
Repository ๐Ÿค—https://huggingface.co/mlx-community/Devstral-Samll-2507-bf16 
Base Model(s)  Devstral Small 2507   mistralai/Devstral-Small-2507
Model Size23.6b
Required VRAM46.9 GB
Updated2025-10-08
Maintainermlx-community
Model Typemistral
Model Files  5.1 GB: 1-of-10   5.2 GB: 2-of-10   5.1 GB: 3-of-10   5.2 GB: 4-of-10   5.2 GB: 5-of-10   5.1 GB: 6-of-10   5.2 GB: 7-of-10   5.2 GB: 8-of-10   4.3 GB: 9-of-10   1.3 GB: 10-of-10
Supported Languagesen fr de es pt it ja ko ru zh ar fa id ms ne pl ro sr sv tr uk vi hi bn
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length131072
Model Max Length131072
Transformers Version4.53.1
Tokenizer ClassLlamaTokenizerFast
Padding Token<pad>
Vocabulary Size131072
Torch Data Typebfloat16

Quantized Models of the Devstral Samll 2507 Bf16

Model
Likes
Downloads
VRAM
Devstral Small 2507 4bit DWQ1117713 GB

Best Alternatives to Devstral Samll 2507 Bf16

Best Alternatives
Context / RAM
Downloads
Likes
Chat KTO128K / 47.3 GB60
Magistral Small 250640K / 47.3 GB360
...stral Small 2501 Tensopolis V132K / 47.3 GB50
XortronCriminalComputingConfig32K / 47.3 GB141775
Magistral Small 2506 Bf1632K / 46.9 GB5710
Sandmisthink24BKaalpac32K / 47.3 GB320
CharGen V3 Beta 275 S032K / 47.3 GB60
...stralThinker V1.1 Reasoner 25632K / 47.3 GB60
Dans LN32K / 47.3 GB131
Dans LN 4432K / 47.3 GB110
Note: green Score (e.g. "73.2") means that the model is better than mlx-community/Devstral-Samll-2507-bf16.

Rank the Devstral Samll 2507 Bf16 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51544 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124