SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 2650 of 369 papers

TitleStatusHype
Transformer with Controlled Attention for Synchronous Motion CaptioningCode0
Unified Framework with Consistency across Modalities for Human Activity RecognitionCode0
Open-Vocabulary Action Localization with Iterative Visual PromptingCode1
FMI-TAL: Few-shot Multiple Instances Temporal Action Localization by Probability Distribution Learning and Interval Cluster RefinementCode0
Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action LocalizationCode1
HAT: History-Augmented Anchor Transformer for Online Temporal Action LocalizationCode1
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action LocalizationCode1
Online Temporal Action Localization with Memory-Augmented Transformer0
Semi-Supervised Pipe Video Temporal Defect Interval Localization0
Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent MechanismCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action LocalizationCode0
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization0
Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action LocalizationCode1
Referring Atomic Video Action RecognitionCode1
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
Open-Vocabulary Temporal Action Localization using Multimodal Guidance0
Self-supervised Multi-actor Social Activity Understanding in Streaming Videos0
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action UnderstandingCode1
ViTALS: Vision Transformer for Action Localization in Surgical Nephrectomy0
SFMViT: SlowFast Meet ViT in Chaotic WorldCode1
STAT: Towards Generalizable Temporal Action Localization0
DeepLocalization: Using change point detection for Temporal Action Localization0
Weakly supervised temporal action localization with actionness-guided false positive suppressionCode0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
Show:102550
← PrevPage 2 of 15Next →

No leaderboard results yet.