Meta Llama 3 8B Instruct Bnb 4bit by alokabhishek

 ยป  All LLMs  ยป  alokabhishek  ยป  Meta Llama 3 8B Instruct Bnb 4bit   URL Share it on

  Arxiv:2305.14314   4-bit   4bit   8b   Autotrain compatible   Bitsandbytes   Bnb   Conversational   Endpoints compatible   Facebook   Instruct   Llama   Llama-3   Meta   Quantized   Region:us   Safetensors   Sharded   Tensorflow

Meta Llama 3 8B Instruct Bnb 4bit Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Meta Llama 3 8B Instruct Bnb 4bit (alokabhishek/Meta-Llama-3-8B-Instruct-bnb-4bit)
๐ŸŒŸ Advertise your project ๐Ÿš€

Meta Llama 3 8B Instruct Bnb 4bit Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
commercial, research
Applications:
assistant-like chat
Primary Use Cases:
natural language generation tasks
Limitations:
Use in languages other than English, Designed for a broad range of applications and may not meet every developer safety preference out-of-the-box
Considerations:
Developers should tailor safety testing and tools according to their use cases
Additional Notes 
4-bit quantization increases accessibility.
Supported Languages 
English (Only)
Training Details 
Data Sources:
publicly available online data
Data Volume:
15 trillion tokens
Methodology:
supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)
Context Length:
8000
Hardware Used:
Meta's Research SuperCluster, third-party cloud compute
Model Architecture:
optimized transformer architecture
Safety Evaluation 
Methodologies:
adversarial evaluations
Findings:
significant reduction in false refusals compared to Llama 2
Risk Categories:
CBRNE, Cyber Security, Child Safety
Ethical Considerations:
Limited misuse and harm reported, developers encouraged to perform safety evaluations
Responsible Ai Considerations 
Fairness:
Commitment to reduce residual risks and focus on alignment and helpfulness
Transparency:
Benchmarking standards made transparent
Accountability:
Developers are responsible for ensuring the use is in line with the Llama 3 Community License and Acceptable Use Policy
Mitigation Strategies:
Steps to limit misuse, open source community tools provided
Input Output 
Input Format:
text
Accepted Modalities:
text
Output Format:
text and code generation
Performance Tips:
Use with transformers for optimal integration and use cases.
Release Notes 
Version:
Meta Llama 3
Date:
April 18, 2024
Notes:
Release of Meta Llama 3 family of models, including instruction-tuned and pretrained versions.
LLM NameMeta Llama 3 8B Instruct Bnb 4bit
Repository ๐Ÿค—https://huggingface.co/alokabhishek/Meta-Llama-3-8B-Instruct-bnb-4bit 
Base Model(s)  Meta Llama 3 8B Instruct 64K   NurtureAI/Meta-Llama-3-8B-Instruct-64k
Model Size8b
Required VRAM5.8 GB
Updated2025-09-22
Maintaineralokabhishek
Model Typellama
Instruction-BasedYes
Model Files  4.7 GB: 1-of-2   1.1 GB: 2-of-2
Quantization Type4bit
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length8192
Model Max Length8192
Transformers Version4.40.1
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Torch Data Typefloat16

Best Alternatives to Meta Llama 3 8B Instruct Bnb 4bit

Best Alternatives
Context / RAM
Downloads
Likes
...B Instruct Gradient 1048K 4bit1024K / 4.5 GB62
...B Instruct Gradient 1048K 8bit1024K / 8.6 GB61
...truct Gradient 1048K Bpw6 EXL21024K / 6.7 GB102
...truct Gradient 1048K Bpw5 EXL21024K / 5.8 GB60
Llama 3 8B Instruct 1048K 4bit1024K / 4.5 GB17225
Llama 3 8B Instruct 1048K 8bit1024K / 8.6 GB10117
... Gradient 1048K 8.0bpw H8 EXL21024K / 8.6 GB103
...ct Gradient 1048K Bpw2.25 EXL21024K / 3.4 GB91
Llama 3 8B Instruct 262K 2bit256K / 2.5 GB71
...B Instruct 262k V2 EXL2 5.0bpw256K / 5.8 GB61
Note: green Score (e.g. "73.2") means that the model is better than alokabhishek/Meta-Llama-3-8B-Instruct-bnb-4bit.

Rank the Meta Llama 3 8B Instruct Bnb 4bit Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51534 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124