SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 541550 of 1149 papers

TitleStatusHype
Learning Object State Changes in Videos: An Open-World Perspective0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
Learning from Multiple Sources for Video Summarisation0
DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding0
BioVL-QR: Egocentric Biochemical Vision-and-Language Dataset Using Micro QR Codes0
An Attempt towards Interpretable Audio-Visual Video Captioning0
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction0
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment0
Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking0
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding0
Show:102550
← PrevPage 55 of 115Next →

No leaderboard results yet.