Mpt 7B Storywriter by mosaicml

 »  All LLMs  »  mosaicml  »  Mpt 7B Storywriter   URL Share it on

Mpt 7B Storywriter is an open-source language model by mosaicml. Features: 7b LLM, VRAM: 13.3GB, License: apache-2.0, LLM Explorer Score: 0.15, Arc: 45.7, HellaSwag: 74.1, MMLU: 28.8.

  Arxiv:2108.12409   Arxiv:2205.14135   Arxiv:2302.06675   Autotrain compatible   Composer   Custom code   Dataset:the pile books3   Llm-foundry   Mosaicml   Mpt   Pytorch   Region:us   Sharded

Mpt 7B Storywriter Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Mpt 7B Storywriter Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
Fictional story generation
Limitations:
Can produce factually incorrect output, Model could generate lewd, biased or otherwise offensive outputs
Additional Notes 
MPT-7B-StoryWriter-65k+ uses ALiBi to allow context length extrapolation beyond 65k tokens.
Training Details 
Data Sources:
the_pile_books3
Methodology:
finetuning
Context Length:
65536
Training Time:
2 days
Hardware Used:
8 A100-80GB GPUs
Model Architecture:
Modified decoder-only transformer with FlashAttention and ALiBi
Responsible Ai Considerations 
Fairness:
Model may produce biased outputs.
LLM NameMpt 7B Storywriter
Repository 🤗https://huggingface.co/mosaicml/mpt-7b-storywriter 
Model Size7b
Required VRAM13.3 GB
Updated2025-09-23
Maintainermosaicml
Model Typempt
Model Files  9.9 GB: 1-of-2   3.4 GB: 2-of-2
Model ArchitectureMPTForCausalLM
Licenseapache-2.0
Model Max Length65536
Transformers Version4.28.1
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50432
Torch Data Typebfloat16

Best Alternatives to Mpt 7B Storywriter

Best Alternatives
Context / RAM
Downloads
Likes
Mpt 7B Chat0K / 13.3 GB80920518
Mpt 7B Assistant0K / 13.3 GB111
Mpt 7B0K / 13.3 GB184601173
Mpt 7B Instruct0K / 13.3 GB7946470
Mpt 7B Int8 Ov0K / 0 GB130
Mpt 7B0K / 26.5 GB25101
Shears Mpt 7B 50 Base0K / 13.3 GB72
Mpt 7B 8K0K / 13.3 GB191526
Mpt 7B 8K Chat0K / 13.3 GB194240
Mpt 7B 8K Instruct0K / 13.3 GB201227
Note: green Score (e.g. "73.2") means that the model is better than mosaicml/mpt-7b-storywriter.

Rank the Mpt 7B Storywriter Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53999 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a