Name: Mistral Nemo Instruct 2407
Author: mistralai

Mistral Nemo Instruct 2407 is an open-source language model by mistralai. Features: 12.2b LLM, VRAM: 24.5GB, Context: 128K, License: MNPL-0.1, Instruction-Based, LLM Explorer Score: 0.35.

Base model:finetune:mistralai/... Base model:mistralai/mistral-n... De En Es Fr Instruct It Ja Mistral Mistral-common Pt Region:us Ru Safetensors Sharded Tensorflow Vllm Zh

Model Card on HF 🤗: https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407

Mistral Nemo Instruct 2407 Benchmarks

IFEval: 63.8 vs 88 (so35)^-27.5%

MATH Lvl 5: 12.69

LLME Score: 0.3455

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Mistral Nemo Instruct 2407 (mistralai/Mistral-Nemo-Instruct-2407)

🌟 Advertise your project 🚀

Mistral Nemo Instruct 2407 Parameters and Internals

Model Type

instruct fine-tuned

Use Cases

Areas:

research, multilingual tasks, code tasks

Limitations:

No moderation mechanisms

Considerations:

Model should finely respect guardrails for environments requiring moderated outputs.

Additional Notes

Trained jointly by Mistral AI and NVIDIA. Drop-in replacement of Mistral 7B.

Supported Languages

en (English), fr (French), de (German), es (Spanish), it (Italian), pt (Portuguese), ru (Russian), zh (Chinese), ja (Japanese)

Training Details

Methodology:

Trained on a large proportion of multilingual and code data

Context Length:

128000

Model Architecture:

Transformer model with 40 layers, Dim: 5,120, Head dim: 128, Hidden dim: 14,336, Activation Function: SwiGLU, Number of heads: 32, Number of kv-heads: 8 (GQA), Vocabulary size: 128k (2^17), Rotary embeddings (theta = 1M)

LLM Name	Mistral Nemo Instruct 2407
Repository 🤗	https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407
Base Model(s)	Mistral Nemo Base 2407 mistralai/Mistral-Nemo-Base-2407
Model Size	12.2b
Required VRAM	24.5 GB
Updated	2026-04-21
Maintainer	mistralai
Instruction-Based	Yes
Model Files	24.5 GB 4.9 GB: 1-of-5 4.9 GB: 2-of-5 4.9 GB: 3-of-5 4.9 GB: 4-of-5 4.9 GB: 5-of-5
Supported Languages	en fr de es it pt ru zh ja
Model Architecture	MambaSSM
License	MNPL-0.1
Context Length	131072
Model Max Length	131072
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	131072

Quantized Models of the Mistral Nemo Instruct 2407

Model	Likes	Downloads	VRAM
Geneva 12B GCv2 5M	13	4	24 GB
NemoR	0	24	24 GB
...omix Unleashed 12B V0.6.1 8bit	2	4	12 GB
DeepOpus 1 12B Preview	0	18	24 GB
DeepNeo 1 12B Preview	0	15	24 GB
Ava 1.0 12B	4	34	24 GB
...istral Nemo Instruct 2407 GGUF	66	1586	4 GB
...Horizon AI Korean Advanced 12B	1	146	24 GB
...orean Mistral Nemo Sft DPO 12B	8	286	24 GB

Best Alternatives to Mistral Nemo Instruct 2407

Best Alternatives	Context / RAM	Downloads	Likes
Mistral Small Instruct 2409	0K / 44.7 GB	12660	394

Rank the Mistral Nemo Instruct 2407 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 53185 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Mistral Nemo Instruct 2407 by mistralai

» All LLMs » mistralai » Mistral Nemo Instruct 2407 URL Share it on