InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Mar 22, 2024 Action Classification Action Recognition
Code Code Available 75 InternVideo: General Video Foundation Models via Generative and Discriminative Learning Dec 6, 2022 Action Classification Action Recognition
Code Code Available 45 Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Mar 14, 2024 Mamba Moment Retrieval
Code Code Available 35 A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications Jun 2, 2022 Action Recognition Sports Analytics
Code Code Available 35 Structured Attention Composition for Temporal Action Localization May 20, 2022 Action Detection Action Localization
Code Code Available 25 Leveraging Temporal Contextualization for Video Action Recognition Apr 15, 2024 Action Recognition Temporal Action Localization
Code Code Available 25 Temporal Segment Networks for Action Recognition in Videos May 8, 2017 Action Classification Action Recognition
Code Code Available 25 ActionFormer: Localizing Moments of Actions with Transformers Feb 16, 2022 Action Localization Action Recognition
Code Code Available 25 TriDet: Temporal Action Detection with Relative Boundary Modeling Mar 13, 2023 Action Detection Temporal Action Localization
Code Code Available 25 UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection Apr 7, 2024 Action Detection Moment Queries
Code Code Available 25 Test-Time Zero-Shot Temporal Action Localization Apr 8, 2024 Action Localization Language Modelling
Code Code Available 25 NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023 Jul 5, 2023 Action Localization Moment Queries
Code Code Available 25 End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames Nov 28, 2023 Action Detection Temporal Action Localization
Code Code Available 25 Perception Test: A Diagnostic Benchmark for Multimodal Video Models May 23, 2023 Diagnostic Grounded Video Question Answering
Code Code Available 25 The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Jun 26, 2024 Action Localization Moment Retrieval
Code Code Available 25 On the Benefits of 3D Pose and Tracking for Human Action Recognition Apr 3, 2023 Action Recognition Temporal Action Localization
Code Code Available 25 Temporal Action Localization with Enhanced Instant Discriminability Sep 11, 2023 Action Detection Action Localization
Code Code Available 25 Temporal Segment Networks: Towards Good Practices for Deep Action Recognition Aug 2, 2016 Action Classification Action Recognition
Code Code Available 25 VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking Mar 29, 2023 Action Classification Action Recognition
Code Code Available 25 AIM: Adapting Image Models for Efficient Video Action Recognition Feb 6, 2023 Action Classification Action Recognition
Code Code Available 25 Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries Challenge Nov 16, 2022 Action Localization Moment Queries
Code Code Available 25 AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Jul 5, 2024 Action Recognition Few-Shot Image Classification
Code Code Available 25 Cross-modal Consensus Network forWeakly Supervised Temporal Action Localization Jul 27, 2021 Action Localization Temporal Action Localization
Code Code Available 15 Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization Jul 27, 2021 Action Localization Temporal Action Localization
Code Code Available 15 CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors Feb 7, 2022 Action Recognition Temporal Action Localization
Code Code Available 15 Bottom-Up Temporal Action Localization with Mutual Regularization Feb 18, 2020 Action Localization Temporal Action Localization
Code Code Available 15 A Closer Look at Spatiotemporal Convolutions for Action Recognition Nov 30, 2017 Action Classification Action Recognition
Code Code Available 15 ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization Apr 7, 2021 Action Localization Temporal Action Localization
Code Code Available 15 A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition Mar 23, 2023 Action Recognition Domain Adaptation
Code Code Available 15 A Lie Group Approach to Riemannian Batch Normalization Mar 17, 2024 Action Recognition EEG
Code Code Available 15 A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization Jan 3, 2021 Action Localization Hard Attention
Code Code Available 15 Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization May 1, 2022 Action Localization Data Augmentation
Code Code Available 15 Complex Sequential Understanding through the Awareness of Spatial and Temporal Concepts May 30, 2020 Action Recognition Temporal Action Localization
Code Code Available 15 Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art Nov 21, 2023 Action Recognition Skeleton Based Action Recognition
Code Code Available 15 Compressing Recurrent Neural Networks with Tensor Ring for Action Recognition Nov 19, 2018 Action Recognition Temporal Action Localization
Code Code Available 15 Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation Apr 17, 2018 Action Recognition RF-based Pose Estimation
Code Code Available 15 DCAN: Improving Temporal Action Detection via Dual Context Aggregation Dec 7, 2021 Action Detection Temporal Action Localization
Code Code Available 15 BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation Sep 15, 2020 Action Localization Relation
Code Code Available 15 Boundary-sensitive Pre-training for Temporal Localization in Videos Nov 21, 2020 Action Classification Classification
Code Code Available 15 CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks Mar 28, 2020 3D Medical Imaging Segmentation Action Recognition
Code Code Available 15 ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos Jul 17, 2024 Action Detection Action Localization
Code Code Available 15 Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition Jul 1, 2021 Action Recognition Temporal Action Localization
Code Code Available 15 ActionCLIP: A New Paradigm for Video Action Recognition Sep 17, 2021 Action Classification Action Recognition
Code Code Available 15 Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization Jun 14, 2020 Action Detection Action Localization
Code Code Available 15 Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection Nov 28, 2023 Contrastive Learning Highlight Detection
Code Code Available 15 BSN: Boundary Sensitive Network for Temporal Action Proposal Generation Jun 8, 2018 Action Detection Temporal Action Localization
Code Code Available 15 BMN: Boundary-Matching Network for Temporal Action Proposal Generation Jul 23, 2019 Action Detection Action Recognition
Code Code Available 15 Background Suppression Network for Weakly-supervised Temporal Action Localization Nov 22, 2019 Action Localization Temporal Action Localization
Code Code Available 15 CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning Mar 30, 2021 Action Localization CoLA
Code Code Available 15 Weakly-supervised Temporal Action Localization by Uncertainty Modeling Jun 12, 2020 Action Classification Action Localization
Code Code Available 15