SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10911100 of 1149 papers

TitleStatusHype
Spatio-Temporal Context for Action Detection0
Spatio-Temporal Crop Aggregation for Video Representation Learning0
Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos0
Spatio-Temporal Video Representation Learning for AI Based Video Playback Style Prediction0
Speeding Up Action Recognition Using Dynamic Accumulation of Residuals in Compressed Domain0
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos0
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions0
SPOT! Revisiting Video-Language Models for Event Understanding0
Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips0
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training0
Show:102550
← PrevPage 110 of 115Next →

No leaderboard results yet.