SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 201210 of 1149 papers

TitleStatusHype
Action Scene Graphs for Long-Form Understanding of Egocentric VideosCode1
MECD+: Unlocking Event-Level Causal Graph Discovery for Video ReasoningCode1
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?Code1
Learning Temporally Latent Causal Processes from General Temporal DataCode1
Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video RepresentationCode1
Contrastive Masked Autoencoders for Self-Supervised Video HashingCode1
Learning the Predictability of the FutureCode1
Localizing Moments in Long Video Via Multimodal GuidanceCode1
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual ActionsCode1
Learning Salient Boundary Feature for Anchor-free Temporal Action LocalizationCode1
Show:102550
← PrevPage 21 of 115Next →

No leaderboard results yet.