LLaMA 65B 4bit 32g by Neko-Institute-of-Science

 ยป  All LLMs  ยป  Neko-Institute-of-Science  ยป  LLaMA 65B 4bit 32g   URL Share it on

  4bit   Autotrain compatible   Endpoints compatible   Llama   Quantized   Region:us

LLaMA 65B 4bit 32g Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
LLaMA 65B 4bit 32g (Neko-Institute-of-Science/LLaMA-65B-4bit-32g)
๐ŸŒŸ Advertise your project ๐Ÿš€

LLaMA 65B 4bit 32g Parameters and Internals

Model Type 
text generation
Additional Notes 
The user attempted to run the model with group sizes of 16 and 32g for specific benchmarks and noted the ability to run the full context on an A6000.
LLM NameLLaMA 65B 4bit 32g
Repository ๐Ÿค—https://huggingface.co/Neko-Institute-of-Science/LLaMA-65B-4bit-32g 
Model Size65b
Required VRAM38.5 GB
Updated2025-09-13
MaintainerNeko-Institute-of-Science
Model Typellama
Model Files  38.5 GB
Quantization Type4bit
Model ArchitectureLlamaForCausalLM
Context Length2048
Model Max Length2048
Transformers Version4.28.1
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to LLaMA 65B 4bit 32g

Best Alternatives
Context / RAM
Downloads
Likes
Robin 65B V2 Fp162K / 130.4 GB18163
...Unlocked Alpaca 65B QLoRA Fp162K / 130.4 GB181510
Alpaca Elina 65B 4bit2K / 33.5 GB17957
...65B Gpt4 1.2 4bit 32g Actorder2K / 38.5 GB16670
....4.1 PI 8192 4bit 32g Actorder2K / 38.5 GB48
Enterredaas 65B 4bit 128g2K / 35.7 GB21
VicUnlocked Alpaca 65B 4bit2K / 35.7 GB86
Llama 65B 4bit2K / 34.7 GB112
LLaMA 65B 4bit 128g2K / 34.7 GB1216
Lite Oute 1 65M2K / 0.3 GB439
Note: green Score (e.g. "73.2") means that the model is better than Neko-Institute-of-Science/LLaMA-65B-4bit-32g.

Rank the LLaMA 65B 4bit 32g Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51352 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124