Pythia 70M Deduped Step44k 92bt by klosax

 ยป  All LLMs  ยป  klosax  ยป  Pythia 70M Deduped Step44k 92bt   URL Share it on

  Autotrain compatible   Endpoints compatible   Gpt neox   Pytorch   Region:us

Pythia 70M Deduped Step44k 92bt Benchmarks

Pythia 70M Deduped Step44k 92bt (klosax/pythia-70m-deduped-step44k-92bt)
๐ŸŒŸ Advertise your project ๐Ÿš€

Pythia 70M Deduped Step44k 92bt Parameters and Internals

LLM NamePythia 70M Deduped Step44k 92bt
Repository ๐Ÿค—https://huggingface.co/klosax/pythia-70m-deduped-step44k-92bt 
Model Size70m
Required VRAM0.2 GB
Updated2025-08-18
Maintainerklosax
Model Typegpt_neox
Model Files  0.2 GB
Model ArchitectureGPTNeoXForCausalLM
Licenseother
Context Length2048
Model Max Length2048
Transformers Version4.24.0
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50304
Torch Data Typefloat16

Best Alternatives to Pythia 70M Deduped Step44k 92bt

Best Alternatives
Context / RAM
Downloads
Likes
Pythia 70m Sft Hh2K / 0.1 GB70
Pythia 70M Multi2K / 0.3 GB50
Pythia Chinese 70M UDN2K / 0.2 GB180
Pythia 70m Sft2K / 0.3 GB70
...ythia 70M Wikipedia Paragraphs2K / 0.3 GB123
Pythia 70M2K / 0.2 GB18370073
Pythia 70M Deduped2K / 0.2 GB18809525
...a 70M Deduped Cleansharegpt En2K / 0.3 GB19440
...thia 70M Deduped Cleansharegpt2K / 0.3 GB19401
Chessdevilai2K / 0.3 GB50
Note: green Score (e.g. "73.2") means that the model is better than klosax/pythia-70m-deduped-step44k-92bt.

Rank the Pythia 70M Deduped Step44k 92bt Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50729 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124