SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 171180 of 1149 papers

TitleStatusHype
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action DetectionCode1
Long Movie Clip Classification with State-Space Video ModelsCode1
AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual SegmentationCode1
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video UnderstandingCode1
Localizing Moments in Long Video Via Multimodal GuidanceCode1
Lightweight Network Architecture for Real-Time Action RecognitionCode1
Leveraging triplet loss for unsupervised action segmentationCode1
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual ActionsCode1
AutoVideo: An Automated Video Action Recognition SystemCode1
Learning Temporally Latent Causal Processes from General Temporal DataCode1
Show:102550
← PrevPage 18 of 115Next →

No leaderboard results yet.