ALMA 13B GPTQ is an open-source language model by TheBloke. Features: 13b LLM, VRAM: 7.3GB, Context: 4K, License: mit, Quantized, LLM Explorer Score: 0.1.
The model is based on a new translation model paradigm starting with fine-tuning on monolingual data and further optimizing using high-quality parallel data.
Supported Languages
Chinese (high), English (high)
Training Details
Data Sources:
monolingual data, human-written parallel data
Data Volume:
20B monolingual tokens
Methodology:
Full-weight fine-tuning
Model Architecture:
LLaMA-2-7B
Input Output
Input Format:
Chinese: {prompt}
Accepted Modalities:
text
Output Format:
English translation
Release Notes
Version:
ALMA-7B
Notes:
Full-weight Fine-tune LLaMA-2-7B on 20B monolingual tokens and further optimized on human-written parallel data.
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/ALMA-13B-GPTQ.
Rank the ALMA 13B GPTQ Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52473 in total.