Megatron Gpt2 345M Evol Instruct V2 by KnutJaegersberg

 ยป  All LLMs  ยป  KnutJaegersberg  ยป  Megatron Gpt2 345M Evol Instruct V2   URL Share it on

Megatron Gpt2 345M Evol Instruct V2 is an open-source language model by KnutJaegersberg. Features: 345m LLM, VRAM: 1.4GB, License: cc-by-nc-4.0, Instruction-Based, HF Score: 30.3, LLM Explorer Score: 0.16, Arc: 26.4, HellaSwag: 38.4, MMLU: 23.6, TruthfulQA: 41.2, WinoGrande: 52.3.

  Autotrain compatible Dataset:knutjaegersberg/wizard...   Endpoints compatible   Gpt2   Instruct   Pytorch   Region:us   Safetensors

Megatron Gpt2 345M Evol Instruct V2 Benchmarks

Megatron Gpt2 345M Evol Instruct V2 (KnutJaegersberg/megatron-gpt2-345m-evol_instruct_v2)
๐ŸŒŸ Advertise your project ๐Ÿš€

Megatron Gpt2 345M Evol Instruct V2 Parameters and Internals

LLM NameMegatron Gpt2 345M Evol Instruct V2
Repository ๐Ÿค—https://huggingface.co/KnutJaegersberg/megatron-gpt2-345m-evol_instruct_v2 
Model Size345m
Required VRAM1.4 GB
Updated2025-09-23
MaintainerKnutJaegersberg
Model Typegpt2
Instruction-BasedYes
Model Files  1.4 GB   1.4 GB   0.0 GB
Model ArchitectureGPT2LMHeadModel
Licensecc-by-nc-4.0
Model Max Length1024
Transformers Version4.32.0.dev0
Tokenizer ClassGPT2Tokenizer
Vocabulary Size50257
Torch Data Typefloat32
Activation Functiongelu_new

Rank the Megatron Gpt2 345M Evol Instruct V2 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51631 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20241124