SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 151200 of 369 papers

TitleStatusHype
LocATe: End-to-end Localization of Actions in 3D with Transformers0
Point3D: tracking actions as moving points with 3D CNNs0
OpenTAL: Towards Open Set Temporal Action LocalizationCode1
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge PropagationCode1
ActionFormer: Localizing Moments of Actions with TransformersCode2
When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in VlogsCode0
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos0
TVNet: Temporal Voting Network for Action LocalizationCode0
Everything at Once - Multi-Modal Fusion Transformer for Video RetrievalCode1
Set-Supervised Action Learning in Procedural Task Videos via Pairwise Order ConsistencyCode1
Exploring Denoised Cross-Video Contrast for Weakly-Supervised Temporal Action Localization0
ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action LocalizationCode0
Temporal Action Proposal Generation with Background ConstraintCode1
Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity0
Contextualized Spatio-Temporal Contrastive Learning with Self-SupervisionCode0
Everything at Once -- Multi-modal Fusion Transformer for Video RetrievalCode1
Graph Convolutional Module for Temporal Action Localization in Videos0
Low-Fidelity Video Encoder Optimization for Temporal Action Localization0
Background-Click Supervision for Temporal Action LocalizationCode1
Unsupervised Action Localization Crop in Video Retargeting for 3D ConvNets0
Towards Active Vision for Action Localization with Reactive Control and Predictive LearningCode1
KORSAL: Key-point Detection based Online Real-Time Spatio-Temporal Action LocalizationCode0
Diagnosing Errors in Video Relation DetectorsCode0
Few-Shot Temporal Action Localization with Query Adaptive TransformerCode1
You Ought to Look Around: Precise, Large Span Action Detection0
Temporal Action Localization with Global Segmentation Mask Transformers0
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text UnderstandingCode0
A Survey on Temporal Sentence Grounding in Videos0
Class Semantics-based Attention for Action Detection0
Foreground-Action Consistency Network for Weakly Supervised Temporal Action LocalizationCode1
Deep Motion Prior for Weakly-Supervised Temporal Action Localization0
Learning Action Completeness from Points for Weakly-supervised Temporal Action LocalizationCode1
Temporal Action Localization Using Gated Recurrent UnitsCode0
Video Contrastive Learning with Global ContextCode1
Cross-modal Consensus Network forWeakly Supervised Temporal Action LocalizationCode1
Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 20210
Enriching Local and Global Contexts for Temporal Action LocalizationCode1
Cross-modal Consensus Network for Weakly Supervised Temporal Action LocalizationCode1
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action LocalizationCode1
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos0
Exploring Stronger Feature for Temporal Action Localization0
Two-Stream Consensus Network: Submission to HACS Challenge 2021 Weakly-Supervised Learning Track0
Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling0
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations0
BABEL: Bodies, Action and Behavior with English LabelsCode1
Relation Modeling in Spatio-Temporal Action Localization0
Few-Shot Action Localization without Knowing BoundariesCode0
Temporal Action Proposal Generation with Transformers0
FineAction: A Fine-Grained Video Dataset for Temporal Action LocalizationCode1
Egocentric Activity Recognition and Localization on a 3D Map0
Show:102550
← PrevPage 4 of 8Next →

No leaderboard results yet.