SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 1120 of 369 papers

TitleStatusHype
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and GlassesCode1
Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction0
A Multimodal Dataset for Enhancing Industrial Task Monitoring and Engagement PredictionCode0
Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport0
Weakly Supervised Temporal Action Localization via Dual-Prior Collaborative Learning Guided by Multimodal Large Language Models0
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments0
Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action LocalizationCode0
Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos0
Temporal Action Localization with Cross Layer Task Decoupling and RefinementCode1
Show:102550
← PrevPage 2 of 37Next →

No leaderboard results yet.