Mamba 2.8B Ultrachat Hf by han1997

 ยป  All LLMs  ยป  han1997  ยป  Mamba 2.8B Ultrachat Hf   URL Share it on

  Autotrain compatible   Endpoints compatible   Mamba   Region:us   Safetensors   Sharded   Tensorflow

Mamba 2.8B Ultrachat Hf Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
๐ŸŒŸ Advertise your project ๐Ÿš€

Mamba 2.8B Ultrachat Hf Parameters and Internals

Model Type 
Causal LM
Additional Notes 
This model is compatible with the 'transformers' library and has been prepared for use with it. The package requires installation from the main branch until version 4.39.0 is released. Additional packages 'causal-conv1d' and 'mamba-ssm' provide optimized CUDA kernel support.
LLM NameMamba 2.8B Ultrachat Hf
Repository ๐Ÿค—https://huggingface.co/han1997/mamba-2.8b-ultrachat-hf 
Model Size2.8b
Required VRAM11.1 GB
Updated2025-06-09
Maintainerhan1997
Model Typemamba
Model Files  5.0 GB: 1-of-3   5.0 GB: 2-of-3   1.1 GB: 3-of-3
Model ArchitectureMambaForCausalLM
Licenseapache-2.0
Transformers Version4.40.0.dev0
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50280
Torch Data Typefloat32
Mamba 2.8B Ultrachat Hf (han1997/mamba-2.8b-ultrachat-hf)

Best Alternatives to Mamba 2.8B Ultrachat Hf

Best Alternatives
Context / RAM
Downloads
Likes
Mamba 2.8B Hf0K / 11.1 GB9366107
Clinicalmamba 2.8B Hf0K / 11.1 GB353
Mamba 2.8B Slimpj Hf0K / 11.1 GB380
Mamba 2.8B Zephyr Hf0K / 11.1 GB200
Mamba 2.8B0K / 5.6 GB191
Mamba 2.8B Slimpj0K / 5.6 GB180
Mamba Ko 2.8B0K / 5.8 GB6418
Mamba 2.8B Hf GGUF0K / 1.4 GB350
Note: green Score (e.g. "73.2") means that the model is better than han1997/mamba-2.8b-ultrachat-hf.

Rank the Mamba 2.8B Ultrachat Hf Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48046 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124