Firefly Mixtral 8x7b by YeungNLP

 ยป  All LLMs  ยป  YeungNLP  ยป  Firefly Mixtral 8x7b   URL Share it on

  Autotrain compatible   En   Endpoints compatible   Mixtral   Moe   Region:us   Safetensors   Sharded   Tensorflow

Firefly Mixtral 8x7b Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Firefly Mixtral 8x7b (YeungNLP/firefly-mixtral-8x7b)
๐ŸŒŸ Advertise your project ๐Ÿš€

Firefly Mixtral 8x7b Parameters and Internals

Model Type 
text generation
Supported Languages 
en (fluent)
Training Details 
Data Sources:
ultrachat
Data Volume:
48k data
Methodology:
Fine-tuned using Firefly
Input Output 
Input Format:
Tokenized input with [INST] and [/INST] markers
Accepted Modalities:
text
Output Format:
Text response
LLM NameFirefly Mixtral 8x7b
Repository ๐Ÿค—https://huggingface.co/YeungNLP/firefly-mixtral-8x7b 
Model Size46.7b
Required VRAM93.6 GB
Updated2025-08-25
MaintainerYeungNLP
Model Typemixtral
Model Files  4.9 GB: 1-of-19   5.0 GB: 2-of-19   5.0 GB: 3-of-19   4.9 GB: 4-of-19   5.0 GB: 5-of-19   5.0 GB: 6-of-19   4.9 GB: 7-of-19   5.0 GB: 8-of-19   5.0 GB: 9-of-19   4.9 GB: 10-of-19   5.0 GB: 11-of-19   5.0 GB: 12-of-19   5.0 GB: 13-of-19   4.9 GB: 14-of-19   5.0 GB: 15-of-19   5.0 GB: 16-of-19   4.9 GB: 17-of-19   5.0 GB: 18-of-19   4.2 GB: 19-of-19
Supported Languagesen
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.1
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typefloat16

Quantized Models of the Firefly Mixtral 8x7b

Model
Likes
Downloads
VRAM
Firefly Mixtral 8x7b GGUF1010215 GB
Firefly Mixtral 8x7b AWQ2624 GB
Firefly Mixtral 8x7b GPTQ31323 GB

Best Alternatives to Firefly Mixtral 8x7b

Best Alternatives
Context / RAM
Downloads
Likes
Mixtral 8x7B Instruct V0.132K / 93.6 GB3319994536
Nous Hermes 2 Mixtral 8x7B DPO32K / 93.6 GB11406448
Mixtral 8x7B V0.132K / 93.6 GB571771742
GritLM 8x7B KTO32K / 93.6 GB99693
Sensualize Mixtral Bf1632K / 93.6 GB00
Skadi Mixtral V132K / 93.5 GB00
Franziska Mixtral V132K / 93.5 GB00
Typhon Mixtral V132K / 93.4 GB00
Smaug Mixtral V0.132K / 187.7 GB995012
Mixtral 8x7B Instruct V0.1 FP832K / 47.1 GB24600
Note: green Score (e.g. "73.2") means that the model is better than YeungNLP/firefly-mixtral-8x7b.

Rank the Firefly Mixtral 8x7b Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50877 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124