SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 961970 of 1149 papers

TitleStatusHype
O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning0
CogME: A Cognition-Inspired Multi-Dimensional Evaluation Metric for Story Understanding0
Spatio-Temporal Context for Action Detection0
Discerning Generic Event Boundaries in Long-Form Wild Videos0
Long-Short Temporal Contrastive Learning of Video Transformers0
C^3: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues0
Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition0
Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking0
Transformed ROIs for Capturing Visual Transformations in Videos0
A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP0
Show:102550
← PrevPage 97 of 115Next →

No leaderboard results yet.