Starcoder2 3B by bigcode

 ยป  All LLMs  ยป  bigcode  ยป  Starcoder2 3B   URL Share it on

  Arxiv:2004.05150   Arxiv:2205.14135   Arxiv:2207.14255   Arxiv:2305.13245   Arxiv:2402.19173   Autotrain compatible   Code Dataset:bigcode/the-stack-v2-t...   Endpoints compatible   Model-index   Region:us   Safetensors   Starcoder2
Model Card on HF ๐Ÿค—: https://huggingface.co/bigcode/starcoder2-3b 

Starcoder2 3B Benchmarks

๐ŸŒŸ Advertise your project ๐Ÿš€

Starcoder2 3B Parameters and Internals

Model Type 
text-generation
Use Cases 
Areas:
research, programming
Primary Use Cases:
code generation
Limitations:
Generated code may contain bugs or exploits., Inefficiency and unintended outputs
Supported Languages 
programming_languages (17 programming languages,)
Training Details 
Data Sources:
GitHub code, Arxiv, Wikipedia
Data Volume:
3+ trillion tokens
Methodology:
Grouped Query Attention, Fill-in-the-Middle objective
Context Length:
16384
Hardware Used:
160 A100 GPUs
Model Architecture:
Transformer decoder with grouped-query and sliding window attention
Input Output 
Accepted Modalities:
text
LLM NameStarcoder2 3B
Repository ๐Ÿค—https://huggingface.co/bigcode/starcoder2-3b 
Model Size3b
Required VRAM12.1 GB
Updated2025-06-09
Maintainerbigcode
Model Typestarcoder2
Model Files  12.1 GB
Model ArchitectureStarcoder2ForCausalLM
Licensebigcode-openrail-m
Context Length16384
Model Max Length16384
Transformers Version4.37.0.dev0
Tokenizer ClassGPT2Tokenizer
Vocabulary Size49152
Starcoder2 3B (bigcode/starcoder2-3b)

Quantized Models of the Starcoder2 3B

Model
Likes
Downloads
VRAM
Starcoder2 3B AWQ05372 GB
StarCoder2 3B GGUF916661 GB

Best Alternatives to Starcoder2 3B

Best Alternatives
Context / RAM
Downloads
Likes
Starcoder2 3b AutoRedteam16K / 12.7 GB50
Starcoder Proto Code16K / 6.1 GB280
Mojo Starcoder216K / 6.4 GB110
NEAR StructTunedStarcoder216K / 6.1 GB150
NEARCoder 3B16K / 6.1 GB80
NEAR PreTrainedStarCoder216K / 6.1 GB120
Bigcode Starcoder2 3B 8bits16K / 3.2 GB120
Starcoder2 3B Instruct16K / 6.1 GB5524
OpenCodeInterpreter SC2 3B16K / 6.4 GB167
Opencsg Starcoder2 3B V0.116K / 6.4 GB151

Rank the Starcoder2 3B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48023 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124