What are the hardware requirements for Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed?

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed requires approximately 1.4 GB of VRAM and supports a context window of 128K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Where can I download or evaluate Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed?

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed by PrunaAI — VRAM 1.4GB, 128K context

Name: Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed
Author: PrunaAI

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed is an open-source language model by PrunaAI. Features: LLM, VRAM: 1.4GB, Context: 128K, Quantized, Instruction-Based, LLM Explorer Score: 0.12.

2bit Base model:finetune:microsoft/... Base model:microsoft/phi-3-min... Custom code Instruct Phi3 Pruna-ai Quantized Region:us

Model Card on HF 🤗: https://huggingface.co/PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-2bit-smashed

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed Benchmarks

LLME Score: 0.12419

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed (PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-2bit-smashed)

🌟 Advertise your project 🚀

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed Parameters and Internals

Model Type

compressed, optimized

Use Cases

Limitations:

The quality of the model output might vary compared to the base model.

Considerations:

Efficiency results may vary in other settings. We recommend running them in the use-case conditions.

Additional Notes

To compress your own models, contact PrunaAI for premium access and tech support for specific use-cases.

Training Details

Methodology:

The model is compressed with hqq.

Input Output

Input Format:

Tokens from text queries (e.g. 'What is the color of prunes?')

Accepted Modalities:

text

Output Format:

Text generation (decoded response)

Performance Tips:

Directly assess efficiency gains in your use-cases.

LLM Name	Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed
Repository 🤗	https://huggingface.co/PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-2bit-smashed
Base Model(s)	Phi 3 Mini 128K Instruct microsoft/Phi-3-mini-128k-instruct
Required VRAM	1.4 GB
Updated	2025-09-23
Maintainer	PrunaAI
Model Type	phi3
Instruction-Based	Yes
Model Files	1.4 GB
Quantization Type	2bit
Model Architecture	Phi3ForCausalLM
Context Length	131072
Model Max Length	131072
Transformers Version	4.42.4
Tokenizer Class	LlamaTokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	32064
Torch Data Type	bfloat16

Best Alternatives to Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed

Best Alternatives	Context / RAM	Downloads	Likes
Phi 3 Mini 128K Instruct 8bit	128K / 4.1 GB	73	10
Phi 4 Mini Instruct 8bit	128K / 4.1 GB	337	7
Phi 4 Mini Instruct 6bit	128K / 3.1 GB	77	2
Phi 3.5 Mini Instruct 4bit	128K / 2.1 GB	2681	9
Phi 3.5 Mini Instruct 8bit	128K / 4.1 GB	63	6
...m 128K Instruct 8.0bpw H8 EXL2	128K / 13.4 GB	141	4
...hi 3 Medium 128K Instruct 4bit	128K / 7.8 GB	25	2
...hi 3 Medium 128K Instruct 8bit	128K / 14.9 GB	20	2
...m 128K Instruct 6.0bpw H6 EXL2	128K / 10.7 GB	7	3
...dium 128K Instruct 8 0bpw EXL2	128K / 13.4 GB	2	1

Note: green Score (e.g. "73.2") means that the model is better than PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-2bit-smashed.

Rank the Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54931 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed by PrunaAI

» All LLMs » PrunaAI » Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed URL Share it on

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed Benchmarks

Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed Parameters and Internals

Best Alternatives to Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed

Rank the Microsoft Phi 3 Mini 128K Instruct HQQ 2bit Smashed Capabilities

What open-source LLMs or SLMs are you in search of? 54931 in total.