SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 101150 of 369 papers

TitleStatusHype
ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization0
Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges0
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer0
Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text PromptsCode0
Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization0
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction0
A Multimodal Dataset for Enhancing Industrial Task Monitoring and Engagement PredictionCode0
Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport0
Weakly Supervised Temporal Action Localization via Dual-Prior Collaborative Learning Guided by Multimodal Large Language Models0
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments0
Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action LocalizationCode0
Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos0
Multilevel semantic and adaptive actionness learning for weakly supervised temporal action localizationCode0
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization0
IMUVIE: Pickup Timeline Action Localization via Motion Movies0
Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?0
Zero-shot Action Localization via the Confidence of Large Vision-Language Models0
Transformer with Controlled Attention for Synchronous Motion CaptioningCode0
Unified Framework with Consistency across Modalities for Human Activity RecognitionCode0
FMI-TAL: Few-shot Multiple Instances Temporal Action Localization by Probability Distribution Learning and Interval Cluster RefinementCode0
Online Temporal Action Localization with Memory-Augmented Transformer0
Semi-Supervised Pipe Video Temporal Defect Interval Localization0
Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action LocalizationCode0
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization0
Open-Vocabulary Temporal Action Localization using Multimodal Guidance0
Self-supervised Multi-actor Social Activity Understanding in Streaming Videos0
ViTALS: Vision Transformer for Action Localization in Surgical Nephrectomy0
STAT: Towards Generalizable Temporal Action Localization0
DeepLocalization: Using change point detection for Temporal Action Localization0
Weakly supervised temporal action localization with actionness-guided false positive suppressionCode0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization0
PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization0
Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes0
BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin0
Density-Guided Label Smoothing for Temporal Localization of Driving Actions0
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model0
Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action LocalizationCode0
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action LocalizationCode0
Visual Self-paced Iterative Learning for Unsupervised Temporal Action LocalizationCode0
ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization0
POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization0
Guided Attention for Interpretable Motion CaptioningCode0
Proposal-based Temporal Action Localization with Point-level Supervision0
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization0
Boundary-Aware Proposal Generation Method for Temporal Action Localization0
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding0
Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization0
Show:102550
← PrevPage 3 of 8Next →

No leaderboard results yet.