Qwen2 72B Instruct Bpw5.5 EXL2 is an open-source language model by blockblockblock. Features: 72b LLM, VRAM: 51.8GB, Context: 32K, License: other, Quantized, Instruction-Based, LLM Explorer Score: 0.14.
Qwen2 72B Instruct Bpw5.5 EXL2 Parameters and Internals
Model Type
text-generation, multimodal
Use Cases
Areas:
Research, Commercial applications
Supported Languages
English (high), Chinese (high)
Training Details
Methodology:
Pretrained with a large amount of data followed by supervised finetuning and direct preference optimization.
Context Length:
131072
Model Architecture:
Transformer architecture with SwiGLU activation, attention QKV bias, group query attention, and an improved tokenizer adaptive to multiple natural languages and codes.
Input Output
Input Format:
Structured prompts following a chat template
Accepted Modalities:
text
Output Format:
Generated text responses
Performance Tips:
Use rope_scaling for processing longer contexts when necessary
Note: green Score (e.g. "73.2") means that the model is better than blockblockblock/Qwen2-72B-Instruct-bpw5.5-exl2.
Rank the Qwen2 72B Instruct Bpw5.5 EXL2 Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52392 in total.