TinyMistral 6x248M Instruct by M4-ai

 ยป  All LLMs  ยป  M4-ai  ยป  TinyMistral 6x248M Instruct   URL Share it on

  Autotrain compatible Base model:finetune:m4-ai/tiny... Base model:m4-ai/tinymistral-6... Dataset:locutusque/hercules-v1...   En   Endpoints compatible   Instruct   Mixtral   Moe   Region:us   Safetensors

TinyMistral 6x248M Instruct Benchmarks

TinyMistral 6x248M Instruct (M4-ai/TinyMistral-6x248M-Instruct)
๐ŸŒŸ Advertise your project ๐Ÿš€

TinyMistral 6x248M Instruct Parameters and Internals

Model Type 
Language Model (Mixture of Experts)
Use Cases 
Areas:
technical software development, multilingual text generation
Applications:
technical explanations, educational content, policy analysis, problem-solving
Limitations:
Potential biases or harmful content, Not for strict content moderation
Considerations:
Users should be aware of potential biases and exercise caution in sensitive applications.
Additional Notes 
The model is designed for developers and researchers.
Supported Languages 
en (proficient)
Training Details 
Data Sources:
Locutusque/hercules-v1.0
Methodology:
Fine-tuned on the hercules-v1.0 dataset leveraging MoE architecture through LazyMergekit framework.
Model Architecture:
Mixture of Experts (MoE) with several versions of the TinyMistral model.
LLM NameTinyMistral 6x248M Instruct
Repository ๐Ÿค—https://huggingface.co/M4-ai/TinyMistral-6x248M-Instruct 
Base Model(s)  TinyMistral 6x248M   M4-ai/TinyMistral-6x248M
Model Size1b
Required VRAM4 GB
Updated2025-09-23
MaintainerM4-ai
Model Typemixtral
Instruction-BasedYes
Model Files  4.0 GB
Supported Languagesen
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Padding Token<|bos|>
Vocabulary Size32005
Torch Data Typefloat32

Best Alternatives to TinyMistral 6x248M Instruct

Best Alternatives
Context / RAM
Downloads
Likes
TinyMistral 6x248M32K / 4 GB64514

Rank the TinyMistral 6x248M Instruct Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51534 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124