SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 226250 of 369 papers

TitleStatusHype
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition0
Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions0
Spatio-Temporal Action Localization in a Weakly Supervised Setting0
Spatio-temporal Action Recognition: A Survey0
Frequency Selective Augmentation for Video Representation Learning0
Spatio-Temporal Instance Learning: Action Tubes from Class Supervision0
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding0
Spot On: Action Localization from Pointly-Supervised Proposals0
STAT: Towards Generalizable Temporal Action Localization0
Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos0
Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization0
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization0
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Temporal Action Localization by Structured Maximal Sums0
Temporal Action Localization using Long Short-Term Dependency0
Temporal Action Localization with Global Segmentation Mask Transformers0
Temporal Action Localization with Multi-temporal Scales0
Temporal Action Localization With Pyramid of Score Distribution Features0
Temporal Action Localization with Variance-Aware Networks0
Temporal Action Proposal Generation with Transformers0
Temporal Convolution Based Action Proposal: Submission to ActivityNet 20170
Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)0
Temporal Perceiving Video-Language Pre-training0
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos0
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations0
Show:102550
← PrevPage 10 of 15Next →

No leaderboard results yet.