SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 251300 of 369 papers

TitleStatusHype
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 20200
Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action TripletsCode1
Weakly Supervised Temporal Action Localization with Segment-Level Labels0
1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020Code1
Actor-Context-Actor Relation Network for Spatio-Temporal Action LocalizationCode1
Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)0
CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)Code1
Weakly-supervised Temporal Action Localization by Uncertainty ModelingCode1
Learning Temporal Co-Attention Models for Unsupervised Video Action LocalizationCode0
ActionBytes: Learning From Trimmed Videos to Localize Actions0
Visual-Textual Capsule Routing for Text-Based Video Segmentation0
Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance LearningCode0
Weakly-Supervised Action Localization by Generative Attention ModelingCode1
Action Localization through Continual Predictive Learning0
A Novel Online Action Detection Framework from Untrimmed Video Streams0
SF-Net: Single-Frame Supervision for Temporal Action LocalizationCode1
Bottom-Up Temporal Action Localization with Mutual RegularizationCode1
Weakly-Supervised Multi-Person Action Recognition in 360^ Videos0
Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks0
Weakly Supervised Temporal Action Localization Using Deep Metric LearningCode1
End-to-End Learning of Visual Representations from Uncurated Instructional VideosCode1
SoccerDB: A Large-Scale Database for Comprehensive Video UnderstandingCode0
Video action detection by learning graph-based spatio-temporal interactionsCode0
Learning Sparse 2D Temporal Adjacent Networks for Temporal Action LocalizationCode1
Background Suppression Network for Weakly-supervised Temporal Action LocalizationCode1
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action LocalizationCode0
A Proposed Artificial intelligence Model for Real-Time Human Action Localization and Tracking0
Temporal Action Localization using Long Short-Term Dependency0
Towards Train-Test Consistency for Semi-supervised Temporal Action Localization0
Human Action Sequence Classification0
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks0
Hierarchical Self-Attention Network for Action Localization in Videos0
Gaussian Temporal Awareness Networks for Action LocalizationCode0
Graph Convolutional Networks for Temporal Action LocalizationCode0
Deep Concept-wise Temporal Convolutional Networks for Action LocalizationCode0
3C-Net: Category Count and Center Loss for Weakly-Supervised Action LocalizationCode0
Weakly-supervised Action Localization with Background Modeling0
Three Branches: Detecting Actions With Richer Features0
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization0
Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos0
Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 20
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization0
Localizing Unseen Activities in Video via Image Query0
vireoJD-MM at Activity Detection in Extended Videos0
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 20190
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video ClipsCode1
Completeness Modeling and Context Separation for Weakly Supervised Temporal Action LocalizationCode0
Improving Action Localization by Progressive Cross-stream Cooperation0
Exploring Feature Representation and Training strategies in Temporal Action Localization0
Marginalized Average Attentional Network for Weakly-Supervised Learning0
Show:102550
← PrevPage 6 of 8Next →

No leaderboard results yet.