SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 961970 of 1149 papers

TitleStatusHype
TCLR: Temporal Contrastive Learning for Video RepresentationCode1
TrackFormer: Multi-Object Tracking with TransformersCode1
CAG-QIL: Context-Aware Actionness Grouping via Q Imitation Learning for Online Temporal Action Localization0
Attention Is Not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion0
Global Self-Attention Networks0
Learning Self-Similarity in Space and Time as a Generalized Motion for Action RecognitionCode1
Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization0
A Comprehensive Study of Deep Video Action RecognitionCode1
Understanding Action Sequences based on Video Captioning for Learning-from-Observation0
End-to-End Video Instance Segmentation with TransformersCode1
Show:102550
← PrevPage 97 of 115Next →

No leaderboard results yet.