Qwen3 2507 4B Instruct Haiku 4.5 Merged FP16 is an open-source language model by Tralalabs. Features: 4b LLM, VRAM: 8.1GB, Context: 256K, License: apache-2.0, Quantized, Instruction-Based, Merged.
| LLM Name | Qwen3 2507 4B Instruct Haiku 4.5 Merged FP16 |
| Repository 🤗 | https://huggingface.co/Tralalabs/Qwen3-2507-4B-Instruct-Haiku-4.5-Merged-FP16 |
| Base Model(s) | |
| Merged Model | Yes |
| Model Size | 4b |
| Required VRAM | 8.1 GB |
| Updated | 2026-05-09 |
| Maintainer | Tralalabs |
| Model Type | qwen3 |
| Instruction-Based | Yes |
| Model Files | |
| Supported Languages | en |
| Quantization Type | fp16 |
| Model Architecture | Qwen3ForCausalLM |
| License | apache-2.0 |
| Context Length | 262144 |
| Model Max Length | 262144 |
| Transformers Version | 5.8.0 |
| Tokenizer Class | Qwen2Tokenizer |
| Padding Token | <|im_end|> |
| Vocabulary Size | 151936 |
| Errors | replace |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Agent 4b | 256K / 8.1 GB | 362 | 0 |
| ...Instruct 2507 Unsloth Bnb 4bit | 256K / 3.5 GB | 95627 | 14 |
| Qwen3 4B Instruct 2507 4bit | 256K / 2.3 GB | 17910 | 9 |
| Jan V3 4B Base Instruct 4bit | 256K / 2.3 GB | 541 | 2 |
| ...wen3 4B Instruct 2507 Bnb 4bit | 256K / 2.6 GB | 7564 | 5 |
| Fact Extractor Dev 1b | 256K / 8.1 GB | 41 | 0 |
| Jan V3 4B Base Instruct 8bit | 256K / 4.3 GB | 73 | 3 |
| ...4B Instruct 2507 4bit DWQ 2510 | 256K / 2.3 GB | 196 | 2 |
| Qwen3 4B Instruct 2507 8bit | 256K / 4.3 GB | 98 | 4 |
| ...wen3 4B Instruct 2507 Sft Full | 256K / 8.1 GB | 5 | 0 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟