SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 9911000 of 1149 papers

TitleStatusHype
CAG-QIL: Context-Aware Actionness Grouping via Q Imitation Learning for Online Temporal Action Localization0
Global Self-Attention Networks0
Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization0
Attention Is Not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion0
Understanding Action Sequences based on Video Captioning for Learning-from-Observation0
t-EVA: Time-Efficient t-SNE Video Annotation0
Can Temporal Information Help with Contrastive Self-Supervised Learning?0
Cycle-Contrast for Self-Supervised Video Representation Learning0
Co-attentional Transformers for Story-Based Video Understanding0
Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset0
Show:102550
← PrevPage 100 of 115Next →

No leaderboard results yet.