SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 151175 of 369 papers

TitleStatusHype
LocATe: End-to-end Localization of Actions in 3D with Transformers0
Point3D: tracking actions as moving points with 3D CNNs0
OpenTAL: Towards Open Set Temporal Action LocalizationCode1
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge PropagationCode1
ActionFormer: Localizing Moments of Actions with TransformersCode2
When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in VlogsCode0
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos0
TVNet: Temporal Voting Network for Action LocalizationCode0
Exploring Denoised Cross-Video Contrast for Weakly-Supervised Temporal Action Localization0
Everything at Once - Multi-Modal Fusion Transformer for Video RetrievalCode1
Set-Supervised Action Learning in Procedural Task Videos via Pairwise Order ConsistencyCode1
ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action LocalizationCode0
Temporal Action Proposal Generation with Background ConstraintCode1
Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity0
Contextualized Spatio-Temporal Contrastive Learning with Self-SupervisionCode0
Everything at Once -- Multi-modal Fusion Transformer for Video RetrievalCode1
Graph Convolutional Module for Temporal Action Localization in Videos0
Low-Fidelity Video Encoder Optimization for Temporal Action Localization0
Background-Click Supervision for Temporal Action LocalizationCode1
Unsupervised Action Localization Crop in Video Retargeting for 3D ConvNets0
Towards Active Vision for Action Localization with Reactive Control and Predictive LearningCode1
KORSAL: Key-point Detection based Online Real-Time Spatio-Temporal Action LocalizationCode0
Diagnosing Errors in Video Relation DetectorsCode0
Few-Shot Temporal Action Localization with Query Adaptive TransformerCode1
You Ought to Look Around: Precise, Large Span Action Detection0
Show:102550
← PrevPage 7 of 15Next →

No leaderboard results yet.