GPT Sw3 1.3B Instruct by AI-Sweden-Models

 »  All LLMs  »  AI-Sweden-Models  »  GPT Sw3 1.3B Instruct   URL Share it on

GPT Sw3 1.3B Instruct is an open-source language model by AI-Sweden-Models. Features: 1.3b LLM, VRAM: 5.5GB, License: other, Instruction-Based, LLM Explorer Score: 0.12, Arc: 31, HellaSwag: 51.4, MMLU: 26.2, GSM8K: 1.6.

Base model:ai-sweden-models/gp... Base model:finetune:ai-sweden-...   Conversational   Da Dataset:databricks/databricks-...   Dataset:laion/oig   Dataset:openassistant/oasst1   En   Endpoints compatible   Gpt2   Instruct   Is   No   Pytorch   Region:us   Safetensors   Sv

GPT Sw3 1.3B Instruct Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

GPT Sw3 1.3B Instruct Parameters and Internals

Model Type 
language model, text generation
Use Cases 
Areas:
Nordic NLP ecosystem
Applications:
Pre-release for research and evaluation of the capabilities of Large Language Models for the Nordic languages.
Primary Use Cases:
GPT-SW3 can generate coherent text in multiple languages and perform text tasks by casting them as text generation tasks.
Limitations:
Bias, Safety, Generation diversity issues, Hallucination, Overrepresentation/Underrepresentation of certain viewpoints, Stereotypes, May generate inappropriate content
Supported Languages 
da (Unknown), sv (Unknown), no (Unknown), en (Unknown), is (Unknown)
Training Details 
Data Sources:
laion/OIG, databricks/databricks-dolly-15k, OpenAssistant/oasst1
Data Volume:
320B tokens
Methodology:
Trained with the NeMo Megatron GPT implementation.
Model Architecture:
Decoder-only transformer language model.
Input Output 
Input Format:
Raw text or instruction data in chat format.
Accepted Modalities:
text
Output Format:
Generated text
LLM NameGPT Sw3 1.3B Instruct
Repository 🤗https://huggingface.co/AI-Sweden-Models/gpt-sw3-1.3b-instruct 
Base Model(s)  AI-Sweden-Models/gpt-sw3-1.3b   AI-Sweden-Models/gpt-sw3-1.3b
Model Size1.3b
Required VRAM5.5 GB
Updated2026-06-06
MaintainerAI-Sweden-Models
Model Typegpt2
Instruction-BasedYes
Model Files  5.5 GB   5.5 GB
Supported Languagesda sv no en is
Model ArchitectureGPT2LMHeadModel
Licenseother
Model Max Length2048
Transformers Version4.22.1
Tokenizer ClassGPTSw3Tokenizer
Vocabulary Size64000
Torch Data Typefloat32
Activation Functiongelu

Rank the GPT Sw3 1.3B Instruct Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 54677 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a