Mistral 22B V0.1 by Vezora

 ยป  All LLMs  ยป  Vezora  ยป  Mistral 22B V0.1   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   Mistral   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/Vezora/Mistral-22B-v0.1 

Mistral 22B V0.1 Benchmarks

๐ŸŒŸ Advertise your project ๐Ÿš€

Mistral 22B V0.1 Parameters and Internals

Model Type 
Dense Model
Use Cases 
Areas:
Experimentation, Mathematics
Applications:
Proficiency in mathematical tasks
Limitations:
Experimental model, Trained with limited data and time.
Additional Notes 
Stay tuned for the release of V.2 with enhanced features.
Training Details 
Data Sources:
500 regular human written Q/A, 500 tested python examples
Data Volume:
Small scale due to experimental model
Methodology:
First MOE to dense model conversion
Training Time:
Trained in less than an hour
Model Architecture:
22B parameter dense model
Release Notes 
Version:
V.01
Date:
April 11
Notes:
First MOE to dense model conversion; trained on a limited dataset in a short time, expected performance comparable to Llama 1.
LLM NameMistral 22B V0.1
Repository ๐Ÿค—https://huggingface.co/Vezora/Mistral-22B-v0.1 
Model Size22b
Required VRAM44.7 GB
Updated2025-06-09
MaintainerVezora
Model Typemistral
Model Files  4.9 GB: 1-of-9   5.0 GB: 2-of-9   5.0 GB: 3-of-9   4.9 GB: 4-of-9   5.0 GB: 5-of-9   5.0 GB: 6-of-9   4.9 GB: 7-of-9   5.0 GB: 8-of-9   5.0 GB: 9-of-9
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length65536
Model Max Length65536
Transformers Version4.39.3
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typebfloat16
Mistral 22B V0.1 (Vezora/Mistral-22B-v0.1)

Quantized Models of the Mistral 22B V0.1

Model
Likes
Downloads
VRAM
Mistral 22B V0.1 AWQ11312 GB

Best Alternatives to Mistral 22B V0.1

Best Alternatives
Context / RAM
Downloads
Likes
MS Schisandra 22B V0.2128K / 44.7 GB119
...ntheon RP Pure 1.6.2 22B Small128K / 44.7 GB1231
MS Meadowlark 22B128K / 44.7 GB2314
...er The Final Transgression 22B128K / 44.7 GB123
...rker The Final Abomination 22B128K / 44.7 GB84
The Omega Directive M 22B V1.0128K / 44.7 GB202
...Darker The Final Directive 22B128K / 44.7 GB90
Retrograde Omega M 22B V1.0128K / 44.7 GB90
Beeper King 22B128K / 44.7 GB97
... V4x1.6.2RP Cydonia VXXX 22B 8128K / 44.7 GB75
Note: green Score (e.g. "73.2") means that the model is better than Vezora/Mistral-22B-v0.1.

Rank the Mistral 22B V0.1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48046 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124