SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 151200 of 369 papers

TitleStatusHype
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNsCode0
Temporal Action Localization Using Gated Recurrent UnitsCode0
temporal driver action Localization using action classifications methodCode0
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web ImagesCode0
Towards Improving Spatiotemporal Action Recognition in VideosCode0
Transformer with Controlled Attention for Synchronous Motion CaptioningCode0
TURN TAP: Temporal Unit Regression Network for Temporal Action ProposalsCode0
TVNet: Temporal Voting Network for Action LocalizationCode0
Unified Framework with Consistency across Modalities for Human Activity RecognitionCode0
Visual Self-paced Iterative Learning for Unsupervised Temporal Action LocalizationCode0
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text UnderstandingCode0
VideoLSTM Convolves, Attends and Flows for Action RecognitionCode0
Weakly Supervised Action Localization by Sparse Temporal Pooling NetworkCode0
Weakly-Supervised Temporal Action Localization by Progressive Complementary LearningCode0
Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance LearningCode0
Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-FeatureCode0
Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency ConstraintCode0
Weakly supervised temporal action localization with actionness-guided false positive suppressionCode0
When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in VlogsCode0
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action LocalizationCode0
Actor-centered Representations for Action Localization in Streaming Videos0
vireoJD-MM at Activity Detection in Extended Videos0
Learning Discriminative Motion Features Through Detection0
Visual-Textual Capsule Routing for Text-Based Video Segmentation0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments0
DAP3D-Net: Where, What and How Actions Occur in Videos?0
ViTALS: Vision Transformer for Action Localization in Surgical Nephrectomy0
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model0
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization0
Learning to track for spatio-temporal action localization0
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization0
Localizing Actions from Video Labels and Pseudo-Annotations0
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
Localizing Unseen Activities in Video via Image Query0
LocATe: End-to-end Localization of Actions in 3D with Transformers0
LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization0
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization0
Low-Fidelity Video Encoder Optimization for Temporal Action Localization0
Low Pass Filter for Anti-aliasing in Temporal Action Localization0
Towards Train-Test Consistency for Semi-supervised Temporal Action Localization0
Marginalized Average Attentional Network for Weakly-Supervised Learning0
Max-Margin Structured Output Regression for Spatio-Temporal Action Localization0
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization0
Modeling Spatio-Temporal Human Track Structure for Action Localization0
Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 20
Weakly-Supervised Action Localization and Action Recognition using Global-Local Attention of 3D CNN0
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries0
Show:102550
← PrevPage 4 of 8Next →

No leaderboard results yet.