SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 126150 of 369 papers

TitleStatusHype
SimOn: A Simple Framework for Online Temporal Action LocalizationCode1
EgoTaskQA: Understanding Human Tasks in Egocentric VideosCode1
Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization0
Adaptive Perception Transformer for Temporal Action Localization0
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream0
Temporal Action Localization with Multi-temporal Scales0
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in VideosCode0
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos0
HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers0
Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action Localization0
MVP: Robust Multi-View Practice for Driving Action Localization0
Exploring Temporally Dynamic Data Augmentation for Video Recognition0
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization0
Weakly-Supervised Temporal Action Localization by Progressive Complementary LearningCode0
temporal driver action Localization using action classifications methodCode0
Structured Attention Composition for Temporal Action LocalizationCode2
Entity-aware and Motion-aware Transformers for Language-driven Action Localization in VideosCode1
Convex Combination Consistency between Neighbors for Weakly-supervised Action LocalizationCode1
Contrastive Language-Action Pre-training for Temporal Localization0
E^2TAD: An Energy-Efficient Tracking-based Action DetectorCode1
Frequency Selective Augmentation for Video Representation Learning0
MM-SEAL: A Large-scale Video Dataset of Multi-person Multi-grained Spatio-temporally Action Localization0
TALLFormer: Temporal Action Localization with a Long-memory TransformerCode1
Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action LocalizationCode1
Unsupervised Pre-training for Temporal Action Localization TasksCode1
Show:102550
← PrevPage 6 of 15Next →

No leaderboard results yet.