SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 201250 of 369 papers

TitleStatusHype
Improving Action Localization by Progressive Cross-stream Cooperation0
IMUVIE: Pickup Timeline Action Localization via Motion Movies0
JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization0
Learning Actionness via Long-range Temporal Order Verification0
Actor-centered Representations for Action Localization in Streaming Videos0
Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks0
Learning Discriminative Motion Features Through Detection0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization0
Learning to track for spatio-temporal action localization0
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization0
Localizing Actions from Video Labels and Pseudo-Annotations0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization0
Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction0
Rethinking the Faster R-CNN Architecture for Temporal Action Localization0
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization0
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding0
SALAD: Self-Assessment Learning for Action Detection0
Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos0
MM-SEAL: A Large-scale Video Dataset of Multi-person Multi-grained Spatio-temporally Action Localization0
Self-supervised Multi-actor Social Activity Understanding in Streaming Videos0
Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity0
Semi-Supervised Pipe Video Temporal Defect Interval Localization0
Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization0
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition0
Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions0
Spatio-Temporal Action Localization in a Weakly Supervised Setting0
Spatio-temporal Action Recognition: A Survey0
Frequency Selective Augmentation for Video Representation Learning0
Spatio-Temporal Instance Learning: Action Tubes from Class Supervision0
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding0
Spot On: Action Localization from Pointly-Supervised Proposals0
STAT: Towards Generalizable Temporal Action Localization0
Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos0
Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization0
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization0
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Temporal Action Localization by Structured Maximal Sums0
Temporal Action Localization using Long Short-Term Dependency0
Temporal Action Localization with Global Segmentation Mask Transformers0
Temporal Action Localization with Multi-temporal Scales0
Temporal Action Localization With Pyramid of Score Distribution Features0
Temporal Action Localization with Variance-Aware Networks0
Temporal Action Proposal Generation with Transformers0
Temporal Convolution Based Action Proposal: Submission to ActivityNet 20170
Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)0
Temporal Perceiving Video-Language Pre-training0
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos0
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations0
Show:102550
← PrevPage 5 of 8Next →

No leaderboard results yet.