SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 871880 of 1149 papers

TitleStatusHype
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models0
Motion-Guided Masking for Spatiotemporal Representation Learning0
Motion Sensitive Contrastive Learning for Self-supervised Video Representation0
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies0
MovieNet: A Holistic Dataset for Movie Understanding0
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning0
MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding0
MRSN: Multi-Relation Support Network for Video Action Detection0
MSR-VTT: A Large Video Description Dataset for Bridging Video and Language0
Multi-kernel learning of deep convolutional features for action recognition0
Show:102550
← PrevPage 88 of 115Next →

No leaderboard results yet.