SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 176200 of 369 papers

TitleStatusHype
DAP3D-Net: Where, What and How Actions Occur in Videos?0
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model0
Weakly-Supervised Action Localization and Action Recognition using Global-Local Attention of 3D CNN0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization0
Learning to track for spatio-temporal action localization0
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization0
Localizing Actions from Video Labels and Pseudo-Annotations0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization0
Localizing Unseen Activities in Video via Image Query0
LocATe: End-to-end Localization of Actions in 3D with Transformers0
LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization0
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization0
Low-Fidelity Video Encoder Optimization for Temporal Action Localization0
Low Pass Filter for Anti-aliasing in Temporal Action Localization0
Towards Train-Test Consistency for Semi-supervised Temporal Action Localization0
Marginalized Average Attentional Network for Weakly-Supervised Learning0
Max-Margin Structured Output Regression for Spatio-Temporal Action Localization0
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset0
Modeling Spatio-Temporal Human Track Structure for Action Localization0
Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 20
A Better Baseline for AVA0
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries0
Show:102550
← PrevPage 8 of 15Next →

No leaderboard results yet.