SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 861870 of 1149 papers

TitleStatusHype
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token ModelingCode1
MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video ParsingCode1
PyTorchVideo: A Deep Learning Library for Video UnderstandingCode2
Fill-in-the-Blank: A Challenging Video Understanding Evaluation Framework0
Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge0
Attention Mechanisms in Computer Vision: A SurveyCode2
Relational Self-Attention: What's Missing in Attention for Video UnderstandingCode1
Revisiting spatio-temporal layouts for compositional action recognitionCode1
Re-ID-AR: Improved Person Re-identification in Video via Joint Weakly Supervised Action RecognitionCode0
Gradient Frequency Modulation for Visually Explaining Video Understanding Models0
Show:102550
← PrevPage 87 of 115Next →

No leaderboard results yet.