SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 76100 of 369 papers

TitleStatusHype
Everything at Once -- Multi-modal Fusion Transformer for Video RetrievalCode1
Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based ApproachCode1
Set-Supervised Action Learning in Procedural Task Videos via Pairwise Order ConsistencyCode1
End-to-End Learning of Visual Representations from Uncurated Instructional VideosCode1
TALLFormer: Temporal Action Localization with a Long-memory TransformerCode1
ASTRA: An Action Spotting TRAnsformer for Soccer VideosCode1
Convex Combination Consistency between Neighbors for Weakly-supervised Action LocalizationCode1
Few-Shot Temporal Action Localization with Query Adaptive TransformerCode1
Multi-Granularity Hand Action DetectionCode1
Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action LocalizationCode1
Cross-modal Consensus Network for Weakly Supervised Temporal Action LocalizationCode1
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action LocalizationCode1
Cross-modal Consensus Network forWeakly Supervised Temporal Action LocalizationCode1
Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action TripletsCode1
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event LocalizationCode1
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual ActionsCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action LocalizationCode1
BABEL: Bodies, Action and Behavior with English LabelsCode1
Background-Click Supervision for Temporal Action LocalizationCode1
1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020Code1
Weakly-supervised Temporal Action Localization by Uncertainty ModelingCode1
Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action LocalizationCode0
Object Priors for Classifying and Localizing Unseen ActionsCode0
ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action LocalizationCode0
Show:102550
← PrevPage 4 of 15Next →

No leaderboard results yet.