Chat Opt 350M Reward Deepspeed is an open-source language model by AdamG012. Features: 350m LLM, VRAM: 0.7GB, Context: 2K, License: apache-2.0, Instruction-Based, LLM Explorer Score: 0.07.
The training process involves a three-step pipeline including supervised fine tuning, reward model fine tuning, and reinforcement learning from human feedback (RLHF).
Context Length:
2048
Model Architecture:
OPT with 350M parameters, FFN dimensions 4096, Hidden size 1024, Attention heads 16, and Hidden layers 24.
Note: green Score (e.g. "73.2") means that the model is better than AdamG012/chat-opt-350m-reward-deepspeed.
Rank the Chat Opt 350M Reward Deepspeed Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52473 in total.