Llama13b 4bit V2 by sardukar

 ยป  All LLMs  ยป  sardukar  ยป  Llama13b 4bit V2   URL Share it on

  Arxiv:2210.17323   Arxiv:2302.13971   4bit   Autotrain compatible   Endpoints compatible   Llama   Quantized   Region:us

Llama13b 4bit V2 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama13b 4bit V2 (sardukar/llama13b-4bit-v2)
๐ŸŒŸ Advertise your project ๐Ÿš€

Llama13b 4bit V2 Parameters and Internals

Model Type 
quantized, 4-bit
Additional Notes 
Conversion involves using the GPTQ v2 algorithm for 4-bit quantization.
Release Notes 
Version:
v2
Notes:
This model will fail to load with current GPTQ-for-LLaMa implementation
LLM NameLlama13b 4bit V2
Repository ๐Ÿค—https://huggingface.co/sardukar/llama13b-4bit-v2 
Required VRAM7 GB
Updated2025-09-23
Maintainersardukar
Model Typellama
Model Files  7.3 GB   7.0 GB
Quantization Type4bit
Model ArchitectureLLaMAForCausalLM
Transformers Version4.27.0.dev0
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to Llama13b 4bit V2

Best Alternatives
Context / RAM
Downloads
Likes
Panther V12K / 0 GB8031
BigTranslate0K / 26.5 GB86550
Toolpaca0K / 52.1 GB737
Gpt4 X Alpaca0K / 52.1 GB61
Guanaco Dumbdumb0K / 0 GB61
Note: green Score (e.g. "73.2") means that the model is better than sardukar/llama13b-4bit-v2.

Rank the Llama13b 4bit V2 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51534 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124