SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 51100 of 369 papers

TitleStatusHype
Test-Time Zero-Shot Temporal Action LocalizationCode2
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event LocalizationCode1
ASTRA: An Action Spotting TRAnsformer for Soccer VideosCode1
LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization0
PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization0
Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes0
BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin0
Density-Guided Label Smoothing for Temporal Localization of Driving Actions0
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model0
Realigning Confidence with Temporal Saliency Information for Point-Level Weakly-Supervised Temporal Action LocalizationCode1
Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action LocalizationCode0
Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based ApproachCode1
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action LocalizationCode0
Visual Self-paced Iterative Learning for Unsupervised Temporal Action LocalizationCode0
Temporal Action Localization for Inertial-based Human Activity RecognitionCode1
ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization0
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI NavigationCode1
POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization0
Guided Attention for Interpretable Motion CaptioningCode0
Proposal-based Temporal Action Localization with Point-level Supervision0
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization0
Boundary-Aware Proposal Generation Method for Temporal Action Localization0
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding0
Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization0
Temporal Action Localization with Enhanced Instant DiscriminabilityCode2
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability PropagationCode1
Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling0
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action LocalizationCode1
NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023Code2
Actionness Inconsistency-guided Contrastive Learning for Weakly-supervised Temporal Action LocalizationCode1
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition0
Multi-Granularity Hand Action DetectionCode1
A Survey on Video Moment Localization0
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action LocalizationCode1
Action Sensitivity Learning for Temporal Action Localization0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization0
Boosting Weakly-Supervised Temporal Action Localization with Text InformationCode1
Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency ConstraintCode0
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo LabelsCode1
DeepSegmenter: Temporal Action Localization for Detecting Anomalies in Untrimmed Naturalistic Driving VideosCode0
WEAR: An Outdoor Sports Dataset for Wearable and Egocentric Activity RecognitionCode1
JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization0
Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection0
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding0
Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling0
Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-FeatureCode0
Show:102550
← PrevPage 2 of 8Next →

No leaderboard results yet.