BigLlama 3.1 1T Instruct is an open-source language model by mlabonne. Features: 681b LLM, VRAM: 186.1GB, Context: 128K, Instruction-Based, Merged, LLM Explorer Score: 0.16.
This is an experimental self-merged model. The configuration uses a YAML setup with a passthrough merge method. The model's parameters were calculated using a Python script included in the notes.
Training Details
Methodology:
Merge of Meta-Llama-3.1-405B-Instruct and Meta-Llama-3-120B-Instruct using mergekit's passthrough merge method.
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.