Gradientai Llama 3 8B Instruct Gradient 1048K AWQ 4bit Smashed is an open-source language model by PrunaAI. Features: 8b LLM, VRAM: 5.8GB, Context: 1024K, Quantized, Instruction-Based, LLM Explorer Score: 0.13.
The smashed model has a measured inference speed, memory, or energy consumption which is less than 90% of the original base model. Results mentioning "first" are obtained after the first run of the model. "Sync" and "Async" metrics could be relevant depending on the use-case.
Training Details
Data Sources:
WikiText
Methodology:
compressed with AWQ
Hardware Used:
NVIDIA A100-PCIE-40GB
Input Output
Performance Tips:
Check efficiency gains directly in your use-cases. The first run might take more memory or be slower than subsequent runs due to CUDA overheads.
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53151 in total.