SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 201225 of 369 papers

TitleStatusHype
Improving Action Localization by Progressive Cross-stream Cooperation0
IMUVIE: Pickup Timeline Action Localization via Motion Movies0
JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization0
Learning Actionness via Long-range Temporal Order Verification0
Actor-centered Representations for Action Localization in Streaming Videos0
Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks0
Learning Discriminative Motion Features Through Detection0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization0
Learning to track for spatio-temporal action localization0
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization0
Localizing Actions from Video Labels and Pseudo-Annotations0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization0
Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction0
Rethinking the Faster R-CNN Architecture for Temporal Action Localization0
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization0
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding0
SALAD: Self-Assessment Learning for Action Detection0
Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos0
MM-SEAL: A Large-scale Video Dataset of Multi-person Multi-grained Spatio-temporally Action Localization0
Self-supervised Multi-actor Social Activity Understanding in Streaming Videos0
Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity0
Semi-Supervised Pipe Video Temporal Defect Interval Localization0
Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization0
Show:102550
← PrevPage 9 of 15Next →

No leaderboard results yet.