Qwen2 VL 2B Instruct GPTQ Int8 is an open-source language model by Qwen. Features: 2b LLM, VRAM: 3.1GB, Context: 32K, License: apache-2.0, Quantized, Instruction-Based, LLM Explorer Score: 0.16.
Lack of Audio Support, Data timeliness until June 2023, Recognition of specific individuals or IPs, Limited handling of complex instructions, Insufficient counting accuracy, Weak spatial reasoning skills
Additional Notes
Available in different quantizations for broad hardware compatibility.
Supported Languages
en (>=0.8), zh (>=0.8), fr (>=0.8), es (>=0.8), de (>=0.8), ru (>=0.8), ja (>=0.8), ko (>=0.8), ar (>=0.8), vi (>=0.8)
Training Details
Data Sources:
MathVista, DocVQA, RealWorldQA, MTVQA
Methodology:
Instruction-tuning, GPTQ quantization
Model Architecture:
Naive Dynamic Resolution, Multimodal Rotary Position Embedding (M-ROPE)
Input Output
Input Format:
Message-based input with role specification
Accepted Modalities:
text, image, video
Output Format:
Textual descriptions or responses
Performance Tips:
Use flash_attention_2 for acceleration in multi-image and video scenarios
Note: green Score (e.g. "73.2") means that the model is better than Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8.
Rank the Qwen2 VL 2B Instruct GPTQ Int8 Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.