SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 91100 of 369 papers

TitleStatusHype
Bottom-Up Temporal Action Localization with Mutual RegularizationCode1
Weakly Supervised Temporal Action Localization Using Deep Metric LearningCode1
End-to-End Learning of Visual Representations from Uncurated Instructional VideosCode1
Learning Sparse 2D Temporal Adjacent Networks for Temporal Action LocalizationCode1
Background Suppression Network for Weakly-supervised Temporal Action LocalizationCode1
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video ClipsCode1
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual ActionsCode1
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization0
CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization0
DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity RecognitionCode0
Show:102550
← PrevPage 10 of 37Next →

No leaderboard results yet.