Granite 4.1 3B Fp8 by ibm-granite

 »  All LLMs  »  ibm-granite  »  Granite 4.1 3B Fp8   URL Share it on

Granite 4.1 3B Fp8 is an open-source language model by ibm-granite. Features: 3b LLM, VRAM: 4.2GB, Context: 128K, License: apache-2.0, Quantized, LLM Explorer Score: 0.32.

Base model:ibm-granite/granite... Base model:quantized:ibm-grani...   Compressed-tensors   Conversational   Deploy:azure   Endpoints compatible   Gguf   Granite   Granite-4.1   Language   Quantized   Region:us   Safetensors

Granite 4.1 3B Fp8 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Granite 4.1 3B Fp8 Parameters and Internals

LLM NameGranite 4.1 3B Fp8
Repository 🤗https://huggingface.co/ibm-granite/granite-4.1-3b-fp8 
Base Model(s)  ibm-granite/granite-4.1-3b   ibm-granite/granite-4.1-3b
Model Size3b
Required VRAM4.2 GB
Updated2026-05-01
Maintaineribm-granite
Model Typegranite
Model Files  4.2 GB
GGUF QuantizationYes
Quantization Typegguf
Model ArchitectureGraniteForCausalLM
Licenseapache-2.0
Context Length131072
Model Max Length131072
Transformers Version4.57.6
Tokenizer ClassGPT2Tokenizer
Padding Token<|pad|>
Vocabulary Size100352

Best Alternatives to Granite 4.1 3B Fp8

Best Alternatives
Context / RAM
Downloads
Likes
Granite 4.1 3B128K / 6.8 GB145342
Granite 4.1 3B Base128K / 6.8 GB2113
PowerLM 3B4K / 14 GB1158218
PowerLM 3B4K / 14 GB9534020
Granite 3B Mup Test Artifact4K / 14 GB1440
Note: green Score (e.g. "73.2") means that the model is better than ibm-granite/granite-4.1-3b-fp8.

Rank the Granite 4.1 3B Fp8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53373 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a