SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 201210 of 369 papers

TitleStatusHype
Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization0
Multi-modal Prompting for Low-Shot Temporal Action Localization0
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization0
Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling0
Contrastive Language-Action Pre-training for Temporal Localization0
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 20200
MVP: Robust Multi-View Practice for Driving Action Localization0
Representation Learning on Visual-Symbolic Graphs for Video Understanding0
CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization0
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization0
Show:102550
← PrevPage 21 of 37Next →

No leaderboard results yet.