Qwen3 18B QiMing V1.0 Silence Of The Qwen by DavidAU

 ยป  All LLMs  ยป  DavidAU  ยป  Qwen3 18B QiMing V1.0 Silence Of The Qwen   URL Share it on

  Arxiv:2309.00071   Arxiv:2401.02415   All use cases   Autotrain compatible   Bagua Base model:aifeifei798/qiming-... Base model:finetune:aifeifei79...   Brainstorm   Brainstorm 20x   Chat   Code   Code generation   Codegen   Codeqwen   Coder   Coding   Cognitive-architecture   Conversational   Creative   De   Decision-making   En   Endpoints compatible   Finetuned   Fr   Moe   Not-for-all-audiences   Optional thinking   Philosophy-driven-ai   Qiming   Qiming-holos   Qwen   Qwen-coder   Qwen2   Qwen3   Region:us   Safetensors   Sharded   Strategic-analysis   Tensorflow   Zh

Qwen3 18B QiMing V1.0 Silence Of The Qwen Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Qwen3 18B QiMing V1.0 Silence Of The Qwen (DavidAU/Qwen3-18B-QiMing-V1.0-Silence-Of-The-Qwen)
๐ŸŒŸ Advertise your project ๐Ÿš€

Qwen3 18B QiMing V1.0 Silence Of The Qwen Parameters and Internals

LLM NameQwen3 18B QiMing V1.0 Silence Of The Qwen
Repository ๐Ÿค—https://huggingface.co/DavidAU/Qwen3-18B-QiMing-V1.0-Silence-Of-The-Qwen 
Base Model(s)  aifeifei798/QiMing-v1.0-14B   aifeifei798/QiMing-v1.0-14B
Model Size14b
Required VRAM35.5 GB
Updated2025-09-04
MaintainerDavidAU
Model Typeqwen3
Model Files  5.0 GB: 1-of-8   4.9 GB: 2-of-8   5.0 GB: 3-of-8   4.9 GB: 4-of-8   5.0 GB: 5-of-8   4.9 GB: 6-of-8   5.0 GB: 7-of-8   0.8 GB: 8-of-8
Supported Languagesen fr zh de
Model ArchitectureQwen3ForCausalLM
Licenseapache-2.0
Context Length40960
Model Max Length40960
Transformers Version4.55.0
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size151936
Torch Data Typebfloat16
Errorsreplace

Best Alternatives to Qwen3 18B QiMing V1.0 Silence Of The Qwen

Best Alternatives
Context / RAM
Downloads
Likes
SimpleChat 14B V1195K / 29.5 GB332
...0528DistillQwen 14B V27.3 200K195K / 29.5 GB174
...uct 21B Brainstorm20x 128K Ctx128K / 84.1 GB210
Hermes 4 14B40K / 29.5 GB135133
Qwen3 14B40K / 29.7 GB1031362257
Qwen3 14B FP840K / 16.4 GB8247832
Qwen3 14B40K / 29.5 GB2287810
Qwen3 14B Abliterated40K / 59 GB562339
Hermes 4 14B FP840K / 16.4 GB392
QiMing V1.0 14B40K / 29.2 GB520
Note: green Score (e.g. "73.2") means that the model is better than DavidAU/Qwen3-18B-QiMing-V1.0-Silence-Of-The-Qwen.

Rank the Qwen3 18B QiMing V1.0 Silence Of The Qwen Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51112 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124