Swallow 7B Instruct Hf is an open-source language model by tokyotech-llm. Features: 7b LLM, VRAM: 13.7GB, Context: 4K, License: llama2, Instruction-Based, LLM Explorer Score: 0.13.
Models not tuned to ensure outputs align with human intent and safety considerations.
Additional Notes
The models are continually pre-trained and instruction-tuned, emphasizing Japanese language capabilities.
Supported Languages
supported_languages_list (Japanese, English), languages_details (The Swallow model has undergone continual pre-training with the addition of Japanese language data.)
Training Details
Data Sources:
Japanese Wikipedia, RefinedWeb, Swallow Corpus, The Pile
Methodology:
Supervised fine-tuning (SFT) and instruction tuning using Anthropic HH-RLHF, Databricks Dolly 15-k, and OpenAssistant Conversations Dataset.
Model Architecture:
Refer to LLaMA-2 technical report for details on the model architecture.
Input Output
Accepted Modalities:
text
Output Format:
Text
Performance Tips:
Model employs a tokenizer with a broadened vocabulary based on Japanese data, offering efficient text representation and faster inference.
Release Notes
Version:
0.1
Date:
April 26, 2024
Notes:
Release of Swallow-7b-instruct-v0.1, Swallow-13b-instruct-v0.1, and Swallow-70b-instruct-v0.1.
Version:
N/A
Date:
March 2, 2024
Notes:
Release of Swallow-7b-plus-hf with twice as many Japanese tokens as Swallow-7b-hf.
Version:
N/A
Date:
February 4, 2024
Notes:
Release of Swallow-13b-NVE-hf.
Version:
N/A
Date:
January 26, 2024
Notes:
Release of Swallow-7b-NVE-hf, Swallow-7b-NVE-instruct-hf, Swallow-70b-NVE-hf, Swallow-70b-NVE-instruct-hf.
Version:
N/A
Date:
December 19, 2023
Notes:
Release of Swallow-7b-hf, Swallow-7b-instruct-hf, Swallow-13b-hf, Swallow-13b-instruct-hf, Swallow-70b-hf, Swallow-70b-instruct-hf.
Note: green Score (e.g. "73.2") means that the model is better than tokyotech-llm/Swallow-7b-instruct-hf.
Rank the Swallow 7B Instruct Hf Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 51638 in total.