| Model Type | |
| Use Cases |
| Primary Use Cases: | | Custom LLM system evaluation tasks. |
|
| Limitations: | | Performance in specialized task domains like arithmetic or code evaluation may be limited., Support in non-English contexts has not been rigorously tested., Handling long context inputs or parsing structured data formats like JSON is limited. |
|
|
| Additional Notes | | Model is quantized using Flow AI's AWQ safetensors quant. |
|
| Supported Languages | | languages_supported (English), proficiency (Full support) |
|
| Training Details |
| Data Sources: | | Synthetically generated datasets |
|
| Methodology: | | Supervised Fine-Tuning (SFT), RSLoRa fine-tuning, synthetic dataset generation |
|
| Context Length: | |
| Model Architecture: | |
|
| Input Output |
| Input Format: | | Text prompt format with or without user inputs |
|
| Accepted Modalities: | |
| Output Format: | | Structured evaluation outputs including verbal feedback with <tags> and numeric scores. |
|
| Performance Tips: | | Ensure modern GPU use and adherence to prompt templates for efficient evaluations. |
|
|