SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 151200 of 369 papers

TitleStatusHype
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling0
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition0
A Survey on Video Moment Localization0
Action Sensitivity Learning for Temporal Action Localization0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization0
Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency ConstraintCode0
DeepSegmenter: Temporal Action Localization for Detecting Anomalies in Untrimmed Naturalistic Driving VideosCode0
Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection0
JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization0
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding0
Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling0
Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-FeatureCode0
Multi-modal Prompting for Low-Shot Temporal Action Localization0
Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization0
Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary GeneratorCode0
Temporal Perceiving Video-Language Pre-training0
Ego-Only: Egocentric Action Detection without Exocentric Transferring0
Anchor-free temporal action localization via Progressive Boundary-aware BoostingCode0
Few-Shot Common Action Localization via Cross-Attentional Fusion of Context and Temporal Dynamics0
Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization0
PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization0
Two-Stream Networks for Weakly-Supervised Temporal Action Localization With Semantic-Aware Mechanisms0
Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action LocalizationCode0
Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action LocalizationCode0
AdamsFormer for Spatial Action Localization in the Future0
Boosting Positive Segments for Weakly-Supervised Audio-Visual Video ParsingCode0
Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization0
Dilation-Erosion for Single-Frame Supervised Temporal Action LocalizationCode0
Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization0
ReLER@ZJU Submission to the Ego4D Moment Queries Challenge 2022Code0
A Simple Transformer-Based Model for Ego4D Natural Language Queries ChallengeCode0
Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization TasksCode0
Prior-enhanced Temporal Action Localization using Subject-aware Spatial Attention0
Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization0
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream0
Adaptive Perception Transformer for Temporal Action Localization0
Temporal Action Localization with Multi-temporal Scales0
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in VideosCode0
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos0
HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers0
Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action Localization0
MVP: Robust Multi-View Practice for Driving Action Localization0
Exploring Temporally Dynamic Data Augmentation for Video Recognition0
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization0
Weakly-Supervised Temporal Action Localization by Progressive Complementary LearningCode0
temporal driver action Localization using action classifications methodCode0
Contrastive Language-Action Pre-training for Temporal Localization0
Show:102550
← PrevPage 4 of 8Next →

No leaderboard results yet.