SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 251260 of 1149 papers

TitleStatusHype
Compositional Video Understanding with Spatiotemporal Structure-based TransformersCode1
EEV: A Large-Scale Dataset for Studying Evoked Expressions from VideoCode1
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer VideosCode1
Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional PropertiesCode1
Learning Transferable Spatiotemporal Representations from Natural Script KnowledgeCode1
Lightweight Network Architecture for Real-Time Action RecognitionCode1
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action RecognitionCode1
Learning Temporally Causal Latent Processes from General Temporal DataCode1
Learning Salient Boundary Feature for Anchor-free Temporal Action LocalizationCode1
Learning Self-Similarity in Space and Time as a Generalized Motion for Action RecognitionCode1
Show:102550
← PrevPage 26 of 115Next →

No leaderboard results yet.