SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 4150 of 369 papers

TitleStatusHype
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
Open-Vocabulary Temporal Action Localization using Multimodal Guidance0
Self-supervised Multi-actor Social Activity Understanding in Streaming Videos0
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action UnderstandingCode1
ViTALS: Vision Transformer for Action Localization in Surgical Nephrectomy0
SFMViT: SlowFast Meet ViT in Chaotic WorldCode1
STAT: Towards Generalizable Temporal Action Localization0
DeepLocalization: Using change point detection for Temporal Action Localization0
Weakly supervised temporal action localization with actionness-guided false positive suppressionCode0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
Show:102550
← PrevPage 5 of 37Next →

No leaderboard results yet.