What are the hardware requirements for BabyMistral?

BabyMistral requires approximately 0.9 GB of VRAM and supports a context window of 8K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed BabyMistral and how large is it?

BabyMistral is developed by OEvortex, a model with 1.6b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate BabyMistral?

BabyMistral is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

BabyMistral by OEvortex — VRAM 0.9GB, 8K context

Name: BabyMistral
Author: OEvortex

BabyMistral is an open-source language model by OEvortex. Features: 1.6b LLM, VRAM: 0.9GB, Context: 8K, License: apache-2.0, Quantized, LLM Explorer Score: 0.15.

Conversational En Endpoints compatible Gguf Mistral Q4 Quantized Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/OEvortex/BabyMistral

BabyMistral Benchmarks

LLME Score: 0.14685

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

🌟 Advertise your project 🚀

BabyMistral Parameters and Internals

Model Type

text-generation

Use Cases

Applications:

Text completion and generation, Creative writing assistance, Dialogue systems, Question answering, Language understanding tasks

Limitations:

May struggle with very specialized or technical domains, Lacks real-time knowledge beyond its training data, Potential for generating plausible-sounding but incorrect information

Supported Languages

en (proficient)

Training Details

Data Sources:

1.5 trillion tokens

Data Volume:

1.5 trillion tokens

Methodology:

Trained from scratch

Training Time:

70 days

Hardware Used:

4x NVIDIA A100 GPUs

Model Architecture:

Based on Mistral

Responsible Ai Considerations

Fairness:

The model may reproduce biases present in its training data.

Mitigation Strategies:

Generated content should be reviewed for accuracy and appropriateness.

LLM Name	BabyMistral
Repository 🤗	https://huggingface.co/OEvortex/BabyMistral
Model Size	1.6b
Required VRAM	0.9 GB
Updated	2026-05-21
Maintainer	OEvortex
Model Type	mistral
Model Files	0.9 GB 3.1 GB
Supported Languages	en
GGUF Quantization	Yes
Quantization Type	q4\|gguf\|q4_k
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	8192
Model Max Length	8192
Transformers Version	4.44.0
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	32002
Torch Data Type	bfloat16

Rank the BabyMistral Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 53999 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

BabyMistral by OEvortex

» All LLMs » OEvortex » BabyMistral URL Share it on

BabyMistral Benchmarks

BabyMistral Parameters and Internals

Rank the BabyMistral Capabilities

What open-source LLMs or SLMs are you in search of? 53999 in total.