SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 301350 of 369 papers

TitleStatusHype
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization0
AdamsFormer for Spatial Action Localization in the Future0
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations0
Three Branches: Detecting Actions With Richer Features0
Actor-Centric Relation Network0
Activity Graph Transformer for Temporal Action Localization0
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization0
Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization0
You Ought to Look Around: Precise, Large Span Action Detection0
Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 20210
Zero-shot Action Localization via the Confidence of Large Vision-Language Models0
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 20190
Action Unit Memory Network for Weakly Supervised Temporal Action Localization0
Tubelets: Unsupervised action proposals from spatiotemporal super-voxels0
Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges0
Weakly Supervised Temporal Action Localization via Dual-Prior Collaborative Learning Guided by Multimodal Large Language Models0
ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization0
Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization0
Two-Stream Consensus Network: Submission to HACS Challenge 2021 Weakly-Supervised Learning Track0
Two-Stream Networks for Weakly-Supervised Temporal Action Localization With Semantic-Aware Mechanisms0
Action Shuffling for Weakly Supervised Temporal Localization0
What do I Annotate Next? An Empirical Study of Active Learning for Action Localization0
Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling0
Unsupervised Action Discovery and Localization in Videos0
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization0
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream0
Exploring Denoised Cross-Video Contrast for Weakly-Supervised Temporal Action Localization0
Exploring Feature Representation and Training strategies in Temporal Action Localization0
Exploring Frame Segmentation Networks for Temporal Action Localization0
Ego-Only: Egocentric Action Detection without Exocentric Transferring0
Exploring Stronger Feature for Temporal Action Localization0
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos0
Exploring Temporally Dynamic Data Augmentation for Video Recognition0
Exploring Temporal Preservation Networks for Precise Temporal Action Localization0
Unsupervised Action Localization Crop in Video Retargeting for 3D ConvNets0
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization0
Few-Shot Common Action Localization via Cross-Attentional Fusion of Context and Temporal Dynamics0
Egocentric Activity Recognition and Localization on a 3D Map0
Few-Shot Transformation of Common Actions into Time and Space0
Efficient Action Localization with Approximately Normalized Fisher Vectors0
Efficient Action Detection in Untrimmed Videos via Multi-Task Learning0
Divide and Conquer for Single-Frame Temporal Action Localization0
Action Sensitivity Learning for Temporal Action Localization0
Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action Localization0
Distributed Adaptive Learning of Graph Signals0
What If We Do Not Have Multiple Videos of the Same Action? -- Video Action Localization Using Web Images0
VideoCapsuleNet: A Simplified Network for Action Detection0
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding0
Generic Tubelet Proposals for Action Localization0
Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization0
Show:102550
← PrevPage 7 of 8Next →

No leaderboard results yet.