DR Venus 4B SFT by inclusionAI

 »  All LLMs  »  inclusionAI  »  DR Venus 4B SFT   URL Share it on

DR Venus 4B SFT is an open-source language model by inclusionAI. Features: 4b LLM, VRAM: 8.8GB, Context: 256K.

  Arxiv:2604.19859   Qwen3   Region:us   Safetensors   Sharded   Tensorflow

DR Venus 4B SFT Parameters and Internals

LLM NameDR Venus 4B SFT
Repository 🤗https://huggingface.co/inclusionAI/DR-Venus-4B-SFT 
Model Size4b
Required VRAM8.8 GB
Updated2026-05-06
MaintainerinclusionAI
Model Typeqwen3
Model Files  5.0 GB: 1-of-2   3.8 GB: 2-of-2
Model ArchitectureQwen3ForCausalLM
Context Length262144
Model Max Length262144
Transformers Version4.52.3
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size151936
Torch Data Typebfloat16
Errorsreplace

Best Alternatives to DR Venus 4B SFT

Best Alternatives
Context / RAM
Downloads
Likes
Qwen3 4B Instruct 2507256K / 8.1 GB10823759832
GRPO 4 70256K / 8.1 GB50
Qwen3 4B Thinking 2507256K / 8.1 GB830285581
OpenSonnet Lite256K / 8.1 GB2816
Lightning 4B256K / 8.1 GB136
QED Nano256K / 8.1 GB681686
Jan V1 4B256K / 8.1 GB108354353
Qwen3 4B Instruct 2507 FP8256K / 5.2 GB71535873
AgentCPM Explore256K / 8.9 GB204413
Qwen3 4B Thinking 2507 FP8256K / 5.2 GB19253066
Note: green Score (e.g. "73.2") means that the model is better than inclusionAI/DR-Venus-4B-SFT.

Rank the DR Venus 4B SFT Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53493 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a