0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated by ali-elganzory

 ยป  All LLMs  ยป  ali-elganzory  ยป  0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated   URL Share it on

0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated is an open-source language model by ali-elganzory. Features: 300b LLM, VRAM: 0.8GB, Context: 4K, LLM Explorer Score: 0.32.

Base model:ali-elganzory/0.4b-... Base model:finetune:ali-elganz...   Conversational   Custom code   Generated from trainer   Opensci   Region:us   Safetensors   Sft   Trl

0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated (ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-SFT-Tulu3-decontaminated)
๐ŸŒŸ Advertise your project ๐Ÿš€

0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated Parameters and Internals

LLM Name0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated
Repository ๐Ÿค—https://huggingface.co/ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-SFT-Tulu3-decontaminated 
Base Model(s)  ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096   ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096
Model Size300b
Required VRAM0.8 GB
Updated2026-04-10
Maintainerali-elganzory
Model Typeopensci
Model Files  0.8 GB   0.0 GB
Model ArchitectureOpensciForCausalLM
Context Length4096
Model Max Length4096
Transformers Version4.57.6
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50304

Best Alternatives to 0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated

Best Alternatives
Context / RAM
Downloads
Likes
....4t 300B 4096 4096 Longsft 16k16K / 3.4 GB1881
...B 16K SFT Tulu3 Decontaminated16K / 3.4 GB2670
... 4096 SFT Tulu3 Decontaminated4K / 3.4 GB2530
... 4096 SFT Tulu3 Decontaminated4K / 3.4 GB2440
....7B Fineweb Edu 1.4t 300B 40964K / 3.4 GB440
....7B Fineweb Edu 1.4t 300B 40964K / 3.4 GB170
Note: green Score (e.g. "73.2") means that the model is better than ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-SFT-Tulu3-decontaminated.

Rank the 0.4B Mixturevitae V1 Decontaminated 300B 4096 SFT Tulu3 Decontaminated Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 52721 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum โ€” our secure, self-hosted AI agent for server management.
Release v20260328a