Zephyr Orpo 141B A35b V0.1 Bpw2.25 is an open-source language model by blockblockblock. Features: 141b LLM, VRAM: 40GB, Context: 64K, License: apache-2.0, MoE, LLM Explorer Score: 0.13.
Zephyr Orpo 141B A35b V0.1 Bpw2.25 Parameters and Internals
Model Type
Mixture of Experts (MoE)
Use Cases
Areas:
research, commercial applications
Primary Use Cases:
chat, code, math, reasoning
Limitations:
The model can produce problematic outputs, especially when prompted to do so., Unknown size and composition of the corpus used to train the base model.
Supported Languages
English (primary)
Training Details
Data Sources:
argilla/distilabel-capybara-dpo-7k-binarized
Data Volume:
7k instances
Methodology:
Odds Ratio Preference Optimization (ORPO)
Training Time:
1.3 hours
Hardware Used:
4 nodes of 8 x H100s
Model Architecture:
Mixture of Experts (MoE) with 141B total parameters and 35B active parameters
Note: green Score (e.g. "73.2") means that the model is better than blockblockblock/zephyr-orpo-141b-A35b-v0.1-bpw2.25.
Rank the Zephyr Orpo 141B A35b V0.1 Bpw2.25 Capabilities
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 53254 in total.