What are the hardware requirements for Phi 4 Mini Instruct ONNX GQA?

Phi 4 Mini Instruct ONNX GQA requires a context window of 128K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Phi 4 Mini Instruct ONNX GQA and how large is it?

Phi 4 Mini Instruct ONNX GQA is developed by onnx-community. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Phi 4 Mini Instruct ONNX GQA?

Phi 4 Mini Instruct ONNX GQA is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Phi 4 Mini Instruct ONNX GQA by onnx-community — 128K context

Name: Phi 4 Mini Instruct ONNX GQA
Author: onnx-community

Phi 4 Mini Instruct ONNX GQA is an open-source language model by onnx-community. Features: LLM, Context: 128K, Instruction-Based, LLM Explorer Score: 0.18.

Base model:microsoft/phi-4-min... Base model:quantized:microsoft... Conversational Instruct Onnx Phi3 Region:us Transformers.js

Model Card on HF 🤗: https://huggingface.co/onnx-community/Phi-4-mini-instruct-ONNX-GQA

Phi 4 Mini Instruct ONNX GQA Benchmarks

LLME Score: 0.18172

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Phi 4 Mini Instruct ONNX GQA (onnx-community/Phi-4-mini-instruct-ONNX-GQA)

🌟 Advertise your project 🚀

Phi 4 Mini Instruct ONNX GQA Parameters and Internals

LLM Name	Phi 4 Mini Instruct ONNX GQA
Repository 🤗	https://huggingface.co/onnx-community/Phi-4-mini-instruct-ONNX-GQA
Base Model(s)	Phi 4 Mini Instruct microsoft/Phi-4-mini-instruct
Updated	2026-03-03
Maintainer	onnx-community
Model Type	phi3
Instruction-Based	Yes
Model Architecture	Phi3ForCausalLM
Context Length	131072
Model Max Length	131072
Transformers Version	4.50.0.dev0
Tokenizer Class	GPT2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	200064
Torch Data Type	bfloat16

Best Alternatives to Phi 4 Mini Instruct ONNX GQA

Best Alternatives	Context / RAM	Downloads	Likes
Phi 4 Mini Instruct	128K / 7.7 GB	637798	706
Phi 3 Mini 128K Instruct	128K / 7.7 GB	252372	1699
Phi 3 Medium 128K Instruct	128K / 28 GB	5878	387
Phi 3.5 Mini Instruct Onnx Web	128K / GB	612	15
Phi 3.5 Mini Instruct Onnx	128K / GB	108	37
Phi 3 Mini 128K Instruct Onnx	128K / GB	97	192
...i 3 Mini 128K Instruct Ov Int4	128K / 2 GB	5	0
...Medium 128K Instruct Onnx Cuda	128K / GB	27	23
... Medium 128K Instruct Onnx Cpu	128K / GB	39	13
...um 128K Instruct Onnx Directml	128K / GB	32	6

Note: green Score (e.g. "73.2") means that the model is better than onnx-community/Phi-4-mini-instruct-ONNX-GQA.

Rank the Phi 4 Mini Instruct ONNX GQA Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 53232 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Phi 4 Mini Instruct ONNX GQA by onnx-community

» All LLMs » onnx-community » Phi 4 Mini Instruct ONNX GQA URL Share it on

Phi 4 Mini Instruct ONNX GQA Benchmarks

Phi 4 Mini Instruct ONNX GQA Parameters and Internals

Best Alternatives to Phi 4 Mini Instruct ONNX GQA

Rank the Phi 4 Mini Instruct ONNX GQA Capabilities

What open-source LLMs or SLMs are you in search of? 53232 in total.