SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 9911000 of 1149 papers

TitleStatusHype
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training0
Video Moment Localization using Object Evidence and Reverse CaptioningCode1
Actor-Context-Actor Relation Network for Spatio-Temporal Action LocalizationCode1
Video Understanding as Machine Translation0
Large Scale Video Representation Learning via Relational Graph Clustering0
Screencast Tutorial Video UnderstandingCode0
Temporal Aggregate Representations for Long-Range Video UnderstandingCode1
CARPe Posterum: A Convolutional Approach for Real-time Pedestrian Path PredictionCode0
DramaQA: Character-Centered Video Story Understanding with Hierarchical QACode0
CATER: A diagnostic dataset for Compositional Actions & TEmporal Reasoning0
Show:102550
← PrevPage 100 of 115Next →

No leaderboard results yet.