SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 781790 of 1149 papers

TitleStatusHype
AE-Net:Adjoint Enhancement Network for Efficient Action Recognition in Video Understanding0
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection0
Spotting Temporally Precise, Fine-Grained Events in VideoCode1
Clover: Towards A Unified Video-Language Alignment and Fusion ModelCode1
SVGraph: Learning Semantic Graphs from Instructional Videos0
Is Appearance Free Action Recognition Possible?Code1
Federated Self-supervised Learning for Video UnderstandingCode1
GraphVid: It Only Takes a Few Nodes to Understand a Video0
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering0
Multimodal Intent Discovery from Livestream Videos0
Show:102550
← PrevPage 79 of 115Next →

No leaderboard results yet.