Flow Judge V0.1 is an open-source language model by flowaicom. Features: 3.8b LLM, VRAM: 7.7GB, Context: 128K, License: apache-2.0, Instruction-Based, LLM Explorer Score: 0.18.
Customizable evaluations for LLM systems using specific rubrics., Scoring scales offering binary, 3-point, and 5-point Likert scales for evaluation.
Limitations:
Multilingual capability not evaluated., Handling of long-context or structured input not addressed., Limited in specialized domains such as arithmetic and coding evaluations., Struggles with domain-specific knowledge outside training scope.
Training Details
Methodology:
Supervised Fine-Tuning (SFT) with RSLoRa.
Context Length:
8192
Model Architecture:
Phi-3.5-mini architecture, customized and fine-tuned for evaluative tasks
Input Output
Input Format:
Specific prompt structures with evaluation criteria, rubrics, and task outputs
Note: green Score (e.g. "73.2") means that the model is better than flowaicom/Flow-Judge-v0.1.
Rank the Flow Judge V0.1 Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52721 in total.