Qwen3 14B Non Thinking V6 16bit by thomasavare

 »  All LLMs  »  thomasavare  »  Qwen3 14B Non Thinking V6 16bit   URL Share it on

Qwen3 14B Non Thinking V6 16bit is an open-source language model by thomasavare. Features: 14b LLM, VRAM: 29.5GB, Context: 40K, License: apache-2.0, Quantized, LLM Explorer Score: 0.21.

  4bit   Autotrain compatible Base model:finetune:unsloth/qw... Base model:unsloth/qwen3-14b-u...   Conversational   En   Endpoints compatible   Quantized   Qwen3   Region:us   Safetensors   Sharded   Tensorflow   Unsloth

Qwen3 14B Non Thinking V6 16bit Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Qwen3 14B Non Thinking V6 16bit Parameters and Internals

LLM NameQwen3 14B Non Thinking V6 16bit
Repository 🤗https://huggingface.co/thomasavare/Qwen3-14B-non-thinking-v6-16bit 
Base Model(s)  Qwen3 14B Unsloth Bnb 4bit   unsloth/Qwen3-14B-unsloth-bnb-4bit
Model Size14b
Required VRAM29.5 GB
Updated2025-09-09
Maintainerthomasavare
Model Typeqwen3
Model Files  5.0 GB: 1-of-6   5.0 GB: 2-of-6   4.9 GB: 3-of-6   5.0 GB: 4-of-6   4.9 GB: 5-of-6   4.7 GB: 6-of-6
Supported Languagesen
Quantization Type4bit
Model ArchitectureQwen3ForCausalLM
Licenseapache-2.0
Context Length40960
Model Max Length40960
Transformers Version4.51.3
Tokenizer ClassQwen2Tokenizer
Padding Token<|vision_pad|>
Vocabulary Size151936
Torch Data Typebfloat16
Errorsreplace

Best Alternatives to Qwen3 14B Non Thinking V6 16bit

Best Alternatives
Context / RAM
Downloads
Likes
Qwen3 14B Unsloth Bnb 4bit40K / 11.2 GB19717414
Qwen3 14B MLX 4bit40K / 8.3 GB397085
Hermes 4 14B 4bit40K / 8.3 GB3213
Qwen3 14B MLX 8bit40K / 15.2 GB11635
Qwen3 14B MLX 4bit40K / 7.9 GB10279
Bee1reason Arabic Qwen 14B40K / 29.5 GB5848
Merged16 Sft Qwen340K / 29.5 GB50
Merged16 Sft Qwen3 32 240K / 29.5 GB110
Qwen3 14B 4bit DWQ 05312540K / 8.3 GB2756
Merged16 Kto Qwen3 14B40K / 29.5 GB50
Note: green Score (e.g. "73.2") means that the model is better than thomasavare/Qwen3-14B-non-thinking-v6-16bit.

Rank the Qwen3 14B Non Thinking V6 16bit Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53185 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a