Llama 2 70B GPTQ is an open-source language model by TheBloke. Features: 70b LLM, VRAM: 35.3GB, Context: 4K, License: llama2, Quantized, LLM Explorer Score: 0.1.
Assistant-like chat, GPTQ quantized for GPU inference
Limitations:
Testing conducted in English, outputs in other languages are out-of-scope
Considerations:
Compliance with Meta's Acceptable Use Policy
Additional Notes
Model architecture uses 4-bit quantized versions for different VRAM requirements and inference quality optimization; supported by AutoGPTQ
Training Details
Data Sources:
Publicly available online data, Publicly available instruction datasets, Over one million new human-annotated examples
Data Volume:
2 trillion tokens for pretraining
Methodology:
Auto-regressive language modeling with transformer architecture. Fine-tuned with supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF).
Context Length:
4096
Training Time:
Llama 2 70B required 1720320 GPU hours
Hardware Used:
Meta's Research Super Cluster, Production clusters, 3.3M GPU hours on A100-80GB GPUs
Model Architecture:
Auto-regressive transformer
Safety Evaluation
Methodologies:
Human evaluations, Internal benchmarks
Findings:
Outperformed open-source chat models on benchmarks, On par with closed-source models like ChatGPT for helpfulness and safety
Risk Categories:
Misinformation, Bias
Ethical Considerations:
Testing conducted in English and has not covered all scenarios; may produce inaccurate or biased outputs
Responsible Ai Considerations
Fairness:
Testing conducted indicates model may produce inaccurate, biased outputs
Transparency:
Safety testing and tuning should be performed for specific applications
Accountability:
Developers need to ensure safety before deploying applications
Mitigation Strategies:
Use safety testing and tuning tailored to specific applications
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Llama-2-70B-GPTQ.
Rank the Llama 2 70B GPTQ Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53999 in total.