| LLM Name | VisionReward Video | 
| Repository ๐ค | https://huggingface.co/THUDM/VisionReward-Video | 
| Model Size | 12.5b | 
| Required VRAM | 25.1 GB | 
| Updated | 2025-07-28 | 
| Maintainer | THUDM | 
| Model Files | |
| Supported Languages | en | 
| Model Architecture | CogVLMVideoForCausalLM | 
| License | other | 
| Context Length | 2048 | 
| Model Max Length | 2048 | 
| Transformers Version | 4.43.1 | 
| Tokenizer Class | PreTrainedTokenizerFast | 
| Vocabulary Size | 128256 | 
| Torch Data Type | bfloat16 | 
| Best Alternatives | Context / RAM | Downloads | Likes | 
|---|---|---|---|
| VisionReward Video | 2K / 25.1 GB | 899 | 6 | 
| Cogvlm2 Video Llama3 Chat | 2K / 25.1 GB | 1110 | 51 | 
| Cogvlm2 Video Llama3 Chat | 2K / 25.1 GB | 206 | 52 | 
| Cogvlm2 Video Llama3 Base | 2K / 25.1 GB | 68 | 1 | 
| Cogvlm2 Video Llama3 Base | 2K / 25.1 GB | 9 | 1 | 
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐