CLEX Mixtral 8x7B Chat 32K by DAMO-NLP-SG

 ยป  All LLMs  ยป  DAMO-NLP-SG  ยป  CLEX Mixtral 8x7B Chat 32K   URL Share it on

  Arxiv:2310.16450   Autotrain compatible   Conversational   Custom code Dataset:damo-nlp-sg/longcorpus...   Endpoints compatible   Mixtral   Moe   Region:us   Safetensors   Sharded   Tensorflow

CLEX Mixtral 8x7B Chat 32K Benchmarks

CLEX Mixtral 8x7B Chat 32K (DAMO-NLP-SG/CLEX-Mixtral-8x7B-Chat-32K)
๐ŸŒŸ Advertise your project ๐Ÿš€

CLEX Mixtral 8x7B Chat 32K Parameters and Internals

Model Type 
Large Language Model, Chatbot
Additional Notes 
The CLEX model provides continuous length extrapolation capabilities without recurrent memory caching or sparse attention requirements. It efficiently extends context window size.
Training Details 
Data Sources:
UltraChat 200k
Methodology:
Continuous Length Extrapolation
Context Length:
32000
LLM NameCLEX Mixtral 8x7B Chat 32K
Repository ๐Ÿค—https://huggingface.co/DAMO-NLP-SG/CLEX-Mixtral-8x7B-Chat-32K 
Model Size46.7b
Required VRAM93.6 GB
Updated2025-09-23
MaintainerDAMO-NLP-SG
Model Typemixtral
Model Files  4.9 GB: 1-of-19   5.0 GB: 2-of-19   5.0 GB: 3-of-19   4.9 GB: 4-of-19   5.0 GB: 5-of-19   5.0 GB: 6-of-19   4.9 GB: 7-of-19   5.0 GB: 8-of-19   5.0 GB: 9-of-19   4.9 GB: 10-of-19   5.0 GB: 11-of-19   5.0 GB: 12-of-19   5.0 GB: 13-of-19   4.9 GB: 14-of-19   5.0 GB: 15-of-19   5.0 GB: 16-of-19   4.9 GB: 17-of-19   5.0 GB: 18-of-19   4.2 GB: 19-of-19
Model ArchitectureMixtralForCausalLM
Licensemit
Context Length32768
Model Max Length32768
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to CLEX Mixtral 8x7B Chat 32K

Best Alternatives
Context / RAM
Downloads
Likes
Mixtral 8x7B Instruct V0.132K / 93.6 GB2956784562
Nous Hermes 2 Mixtral 8x7B DPO32K / 93.6 GB11482450
Mixtral 8x7B V0.132K / 93.6 GB422481755
GritLM 8x7B KTO32K / 93.6 GB97143
Sensualize Mixtral Bf1632K / 93.6 GB00
Skadi Mixtral V132K / 93.5 GB00
Franziska Mixtral V132K / 93.5 GB00
Typhon Mixtral V132K / 93.4 GB00
Smaug Mixtral V0.132K / 187.7 GB971212
NatureLM 8x7B32K / 0.3 GB10315
Note: green Score (e.g. "73.2") means that the model is better than DAMO-NLP-SG/CLEX-Mixtral-8x7B-Chat-32K.

Rank the CLEX Mixtral 8x7B Chat 32K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51535 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124