SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 150 of 369 papers

TitleStatusHype
Temporal Action Localization with Enhanced Instant DiscriminabilityCode2
Structured Attention Composition for Temporal Action LocalizationCode2
NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023Code2
Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries ChallengeCode2
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
Test-Time Zero-Shot Temporal Action LocalizationCode2
ActionFormer: Localizing Moments of Actions with TransformersCode2
Learning to Localize Actions from MomentsCode1
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video ClipsCode1
Localizing the Common Action Among a Few VideosCode1
Cross-modal Consensus Network forWeakly Supervised Temporal Action LocalizationCode1
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action LocalizationCode1
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo LabelsCode1
Learning Sparse 2D Temporal Adjacent Networks for Temporal Action LocalizationCode1
Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action LocalizationCode1
Multi-Granularity Hand Action DetectionCode1
Foreground-Action Consistency Network for Weakly Supervised Temporal Action LocalizationCode1
Enriching Local and Global Contexts for Temporal Action LocalizationCode1
Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action LocalizationCode1
Few-Shot Temporal Action Localization with Query Adaptive TransformerCode1
FineAction: A Fine-Grained Video Dataset for Temporal Action LocalizationCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020Code1
Cross-modal Consensus Network for Weakly Supervised Temporal Action LocalizationCode1
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action LocalizationCode1
HAT: History-Augmented Anchor Transformer for Online Temporal Action LocalizationCode1
Actor-Context-Actor Relation Network for Spatio-Temporal Action LocalizationCode1
HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability PropagationCode1
Learning Action Completeness from Points for Weakly-supervised Temporal Action LocalizationCode1
Learning Salient Boundary Feature for Anchor-free Temporal Action LocalizationCode1
End-to-End Learning of Visual Representations from Uncurated Instructional VideosCode1
Entity-aware and Motion-aware Transformers for Language-driven Action Localization in VideosCode1
A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action LocalizationCode1
Actionness Inconsistency-guided Contrastive Learning for Weakly-supervised Temporal Action LocalizationCode1
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action UnderstandingCode1
EgoTaskQA: Understanding Human Tasks in Egocentric VideosCode1
CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)Code1
Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent MechanismCode1
Everything at Once - Multi-Modal Fusion Transformer for Video RetrievalCode1
ASTRA: An Action Spotting TRAnsformer for Soccer VideosCode1
Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic EventsCode1
CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive LearningCode1
Convex Combination Consistency between Neighbors for Weakly-supervised Action LocalizationCode1
Bottom-Up Temporal Action Localization with Mutual RegularizationCode1
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual ActionsCode1
BABEL: Bodies, Action and Behavior with English LabelsCode1
Background-Click Supervision for Temporal Action LocalizationCode1
Weakly-supervised Temporal Action Localization by Uncertainty ModelingCode1
Background Suppression Network for Weakly-supervised Temporal Action LocalizationCode1
Boosting Weakly-Supervised Temporal Action Localization with Text InformationCode1
Show:102550
← PrevPage 1 of 8Next →

No leaderboard results yet.