| Model Type |
| |||||||||||||||||||
| Use Cases |
| |||||||||||||||||||
| Additional Notes |
| |||||||||||||||||||
| Supported Languages |
| |||||||||||||||||||
| Training Details |
| |||||||||||||||||||
| Input Output |
| |||||||||||||||||||
| Release Notes |
|
| LLM Name | Swallow MS 7B Instruct V0.1 |
| Repository ๐ค | https://huggingface.co/tokyotech-llm/Swallow-MS-7b-instruct-v0.1 |
| Model Size | 7b |
| Required VRAM | 14.6 GB |
| Updated | 2025-09-23 |
| Maintainer | tokyotech-llm |
| Model Type | mistral |
| Instruction-Based | Yes |
| Model Files | |
| Supported Languages | en ja |
| Model Architecture | MistralForCausalLM |
| License | apache-2.0 |
| Context Length | 4096 |
| Model Max Length | 4096 |
| Transformers Version | 4.39.1 |
| Tokenizer Class | LlamaTokenizer |
| Vocabulary Size | 42800 |
| Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| ...Nemo Instruct 2407 Abliterated | 1000K / 24.5 GB | 141 | 18 |
| SpydazWeb AI HumanAI RP | 512K / 14.4 GB | 17 | 1 |
| SpydazWeb AI HumanAI 002 | 512K / 14.4 GB | 18 | 1 |
| ...daz Web AI ChatML 512K Project | 512K / 14.5 GB | 12 | 0 |
| ... Summarize 64K QLoRANET Merged | 128K / 4.1 GB | 6 | 0 |
| ...1 Summarize 64K LoRANET Merged | 128K / 14.4 GB | 6 | 0 |
| Mistral 7B Instruct V0.2 | 32K / 14.4 GB | 3316095 | 2630 |
| Mistral 7B Instruct V0.1 | 32K / 14.4 GB | 153309 | 1797 |
| Seed X Instruct 7B | 32K / 15 GB | 1510 | 123 |
| Mixtral AI CyberCoder | 32K / 14.3 GB | 0 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐