Qwen2 72B Instruct Mix Calibration 4B EXL2 is an open-source language model by Orion-zhen. Features: 72b LLM, VRAM: 38.8GB, Context: 32K, License: other, Quantized, Instruction-Based, LLM Explorer Score: 0.14.
Qwen2 72B Instruct Mix Calibration 4B EXL2 Parameters and Internals
Model Type
text-generation
Use Cases
Areas:
research, commercial applications
Supported Languages
en (High)
Training Details
Methodology:
Pretrained with a large amount of data, and post-trained with both supervised finetuning and direct preference optimization.
Context Length:
131072
Model Architecture:
Transformer architecture with SwiGLU activation, attention QKV bias, group query attention, etc.
Input Output
Accepted Modalities:
text
Performance Tips:
To handle extensive inputs exceeding 32,768 tokens, utilize YARN for enhancing model length extrapolation ensuring optimal performance on lengthy texts.
Note: green Score (e.g. "73.2") means that the model is better than Orion-zhen/Qwen2-72B-Instruct-mix-calibration-4b-exl2.
Rank the Qwen2 72B Instruct Mix Calibration 4B EXL2 Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53151 in total.