SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 301310 of 1149 papers

TitleStatusHype
Spotting Temporally Precise, Fine-Grained Events in VideoCode1
Clover: Towards A Unified Video-Language Alignment and Fusion ModelCode1
Is Appearance Free Action Recognition Possible?Code1
Federated Self-supervised Learning for Video UnderstandingCode1
ST-Adapter: Parameter-Efficient Image-to-Video Transfer LearningCode1
REVECA -- Rich Encoder-decoder framework for Video Event CAptionerCode1
Stand-Alone Inter-Frame Attention in Video ModelsCode1
A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action DetectorCode1
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-AnsweringCode1
Free Lunch for Surgical Video Understanding by Distilling Self-SupervisionsCode1
Show:102550
← PrevPage 31 of 115Next →

No leaderboard results yet.