Starcoder2 7B by bigcode

 ยป  All LLMs  ยป  bigcode  ยป  Starcoder2 7B   URL Share it on

  Arxiv:2004.05150   Arxiv:2205.14135   Arxiv:2207.14255   Arxiv:2305.13245   Arxiv:2402.19173   Autotrain compatible   Code Dataset:bigcode/the-stack-v2-t...   Endpoints compatible   Model-index   Region:us   Safetensors   Sharded   Starcoder2   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/bigcode/starcoder2-7b 

Starcoder2 7B Benchmarks

๐ŸŒŸ Advertise your project ๐Ÿš€

Starcoder2 7B Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
research, commercial applications
Applications:
software development, educational tools
Primary Use Cases:
code generation, automated coding assistance
Limitations:
Not guaranteed to generate functioning code, Might contain inefficiencies or bugs
Additional Notes 
Model is not an instruction model and may not perform well with explicit commands like writing a square root computation.
Supported Languages 
EN (High), multi-language (Limited)
Training Details 
Data Sources:
GitHub code, Arxiv, Wikipedia
Data Volume:
3.5+ trillion tokens
Methodology:
Grouped Query Attention, a context window of 16,384 tokens, sliding window attention of 4,096 tokens, Fill-in-the-Middle objective
Hardware Used:
432 H100 GPUs
Model Architecture:
Transformer decoder with grouped-query and sliding window attention and Fill-in-the-Middle objective
Input Output 
Accepted Modalities:
text
LLM NameStarcoder2 7B
Repository ๐Ÿค—https://huggingface.co/bigcode/starcoder2-7b 
Model Size7b
Required VRAM14.4 GB
Updated2025-06-09
Maintainerbigcode
Model Typestarcoder2
Model Files  4.9 GB: 1-of-3   5.0 GB: 2-of-3   4.5 GB: 3-of-3
Model ArchitectureStarcoder2ForCausalLM
Licensebigcode-openrail-m
Context Length16384
Model Max Length16384
Transformers Version4.37.0.dev0
Tokenizer ClassGPT2Tokenizer
Vocabulary Size49152
Torch Data Typebfloat16
Activation Functiongelu
Starcoder2 7B (bigcode/starcoder2-7b)

Quantized Models of the Starcoder2 7B

Model
Likes
Downloads
VRAM
Starcoder2 7B AWQ1214 GB
StarCoder2 7B GGUF1357142 GB

Best Alternatives to Starcoder2 7B

Best Alternatives
Context / RAM
Downloads
Likes
Dolphincoder Starcoder2 7B16K / 14.9 GB2011
Starcoder2 7B Int4 Ov16K / 3.8 GB150
Jmg Starcoder2 7B 100K16K / 14.4 GB140
Starcoder2 7B Instruct16K / 14.4 GB5732
Speechless Starcoder2 7B16K / 14.4 GB105
OpenCodeInterpreter SC2 7B16K / 14.9 GB3414
Starcoder2 Chat16K / 28.8 GB352
...Starcoder2 7B Bnb 4bit Smashed16K / 4.4 GB120
Starcoder2 7B 4bit16K / 4.4 GB582
Starcoder2 7B AWQ16K / 4.5 GB211

Rank the Starcoder2 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48023 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124