Mpt Mini Shakespeare by jploski

 ยป  All LLMs  ยป  jploski  ยป  Mpt Mini Shakespeare   URL Share it on

  Autotrain compatible   Custom code   Endpoints compatible   Generated from trainer   Mpt   Pytorch   Region:us   Tensorboard

Mpt Mini Shakespeare Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mpt Mini Shakespeare (jploski/mpt-mini-shakespeare)
๐ŸŒŸ Advertise your project ๐Ÿš€

Mpt Mini Shakespeare Parameters and Internals

Use Cases 
Primary Use Cases:
Aid debugging efforts of a GGML port of mpt-7b-storywriter
Additional Notes 
Trained using the Transformers 4.28.0, Pytorch 2.0.1+cu117 framework versions.
Training Details 
Data Sources:
https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt
Methodology:
Trained from scratch using a single text file for both training and validation.
Model Architecture:
Configuration and code adapted from mosaicml/mpt-7b-storywriter, with changes to make it a very tiny model.
LLM NameMpt Mini Shakespeare
Repository ๐Ÿค—https://huggingface.co/jploski/mpt-mini-shakespeare 
Required VRAM0 GB
Updated2025-09-23
Maintainerjploski
Model Typempt
Model Files  0.0 GB   0.0 GB
Model ArchitectureMPTForCausalLM
Transformers Version4.28.0
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50432
Torch Data Typefloat32

Best Alternatives to Mpt Mini Shakespeare

Best Alternatives
Context / RAM
Downloads
Likes
Tiny Mpt Random Remote Code0K / 0 GB66170
WangchanLion7B0K / 29.8 GB138
Replit Code Instruct Glaive0K / 10.4 GB888
Results Sharded Bf16 5GB0K / 13.4 GB50
Replit Coder0K / 5.2 GB130
Gpt4all Mpt0K / 26.6 GB2810
PhoGPT 7B5 GGUF0K / 17 GB373
Note: green Score (e.g. "73.2") means that the model is better than jploski/mpt-mini-shakespeare.

Rank the Mpt Mini Shakespeare Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51535 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124