SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 101150 of 369 papers

TitleStatusHype
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer0
Boundary Uncertainty in a Single-Stage Temporal Action Localization Network0
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization0
Action Localization through Continual Predictive Learning0
Boundary-Aware Proposal Generation Method for Temporal Action Localization0
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream0
AdamsFormer for Spatial Action Localization in the Future0
Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes0
Egocentric Activity Recognition and Localization on a 3D Map0
Efficient Action Localization with Approximately Normalized Fisher Vectors0
Action Localization in Videos Through Context Walk0
A Better Baseline for AVA0
Low-Fidelity Video Encoder Optimization for Temporal Action Localization0
Ego-Only: Egocentric Action Detection without Exocentric Transferring0
Efficient Action Detection in Untrimmed Videos via Multi-Task Learning0
Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport0
Divide and Conquer for Single-Frame Temporal Action Localization0
Distributed Adaptive Learning of Graph Signals0
BLP -- Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization0
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization0
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization0
Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization0
BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin0
Localizing Unseen Activities in Video via Image Query0
Exploring Feature Representation and Training strategies in Temporal Action Localization0
Exploring Frame Segmentation Networks for Temporal Action Localization0
Activity Graph Transformer for Temporal Action Localization0
Beyond Caption To Narrative: Video Captioning With Multiple Sentences0
Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks0
LocATe: End-to-end Localization of Actions in 3D with Transformers0
LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization0
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models0
Detecting Parts for Action Localization0
Localizing Actions from Video Labels and Pseudo-Annotations0
Density-Guided Label Smoothing for Temporal Localization of Driving Actions0
Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
Deep Motion Prior for Weakly-Supervised Temporal Action Localization0
DeepLocalization: Using change point detection for Temporal Action Localization0
Action Unit Memory Network for Weakly Supervised Temporal Action Localization0
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization0
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization0
Low Pass Filter for Anti-aliasing in Temporal Action Localization0
Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection0
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments0
DAP3D-Net: Where, What and How Actions Occur in Videos?0
AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos0
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model0
A Proposed Artificial intelligence Model for Real-Time Human Action Localization and Tracking0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
Show:102550
← PrevPage 3 of 8Next →

No leaderboard results yet.