| Model Type |  | 
| Use Cases | 
| Primary Use Cases: | | Custom LLM system evaluation tasks. | 
 |  | Limitations: | | Performance in specialized task domains like arithmetic or code evaluation may be limited., Support in non-English contexts has not been rigorously tested., Handling long context inputs or parsing structured data formats like JSON is limited. | 
 |  | 
| Additional Notes | | Model is quantized using Flow AI's AWQ safetensors quant. | 
 | 
| Supported Languages | | languages_supported (English), proficiency (Full support) | 
 | 
| Training Details | 
| Data Sources: | | Synthetically generated datasets | 
 |  | Methodology: | | Supervised Fine-Tuning (SFT), RSLoRa fine-tuning, synthetic dataset generation | 
 |  | Context Length: |  |  | Model Architecture: |  |  | 
| Input Output | 
| Input Format: | | Text prompt format with or without user inputs | 
 |  | Accepted Modalities: |  |  | Output Format: | | Structured evaluation outputs including verbal feedback with <tags> and numeric scores. | 
 |  | Performance Tips: | | Ensure modern GPU use and adherence to prompt templates for efficient evaluations. | 
 |  |