InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Mar 22, 2024 Action Classification Action Recognition
Code Code Available 7InternVideo: General Video Foundation Models via Generative and Discriminative Learning Dec 6, 2022 Action Classification Action Recognition
Code Code Available 4Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Mar 14, 2024 Mamba Moment Retrieval
Code Code Available 3A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications Jun 2, 2022 Action Recognition Sports Analytics
Code Code Available 3AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Jul 5, 2024 Action Recognition Few-Shot Image Classification
Code Code Available 2The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Jun 26, 2024 Action Localization Moment Retrieval
Code Code Available 2Leveraging Temporal Contextualization for Video Action Recognition Apr 15, 2024 Action Recognition Temporal Action Localization
Code Code Available 2Test-Time Zero-Shot Temporal Action Localization Apr 8, 2024 Action Localization Language Modelling
Code Code Available 2UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection Apr 7, 2024 Action Detection Moment Queries
Code Code Available 2End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames Nov 28, 2023 Action Detection Temporal Action Localization
Code Code Available 2Temporal Action Localization with Enhanced Instant Discriminability Sep 11, 2023 Action Detection Action Localization
Code Code Available 2NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023 Jul 5, 2023 Action Localization Moment Queries
Code Code Available 2Perception Test: A Diagnostic Benchmark for Multimodal Video Models May 23, 2023 Diagnostic Grounded Video Question Answering
Code Code Available 2On the Benefits of 3D Pose and Tracking for Human Action Recognition Apr 3, 2023 Action Recognition Temporal Action Localization
Code Code Available 2VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking Mar 29, 2023 Action Classification Action Recognition
Code Code Available 2TriDet: Temporal Action Detection with Relative Boundary Modeling Mar 13, 2023 Action Detection Temporal Action Localization
Code Code Available 2AIM: Adapting Image Models for Efficient Video Action Recognition Feb 6, 2023 Action Classification Action Recognition
Code Code Available 2Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries Challenge Nov 16, 2022 Action Localization Moment Queries
Code Code Available 2Structured Attention Composition for Temporal Action Localization May 20, 2022 Action Detection Action Localization
Code Code Available 2ActionFormer: Localizing Moments of Actions with Transformers Feb 16, 2022 Action Localization Action Recognition
Code Code Available 2Temporal Segment Networks for Action Recognition in Videos May 8, 2017 Action Classification Action Recognition
Code Code Available 2Temporal Segment Networks: Towards Good Practices for Deep Action Recognition Aug 2, 2016 Action Classification Action Recognition
Code Code Available 2Zero-Shot Temporal Interaction Localization for Egocentric Videos Jun 4, 2025 Action Localization Human-Object Interaction Detection
Code Code Available 1TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos Mar 9, 2025 Action Localization Boundary Detection
Code Code Available 1XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses Jan 31, 2025 Action Localization Action Recognition
Code Code Available 1Temporal Action Localization with Cross Layer Task Decoupling and Refinement Dec 12, 2024 Action Classification Action Localization
Code Code Available 1SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition Oct 22, 2024 Action Recognition Autonomous Driving
Code Code Available 1Saliency-Guided DETR for Moment Retrieval and Highlight Detection Oct 2, 2024 Highlight Detection Moment Retrieval
Code Code Available 1Fisher Information guided Purification against Backdoor Attacks Sep 1, 2024 Action Recognition backdoor defense
Code Code Available 1Open-Vocabulary Action Localization with Iterative Visual Prompting Aug 30, 2024 Action Localization Temporal Action Localization
Code Code Available 1Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization Aug 25, 2024 Action Localization Temporal Action Localization
Code Code Available 1TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning Aug 20, 2024 Action Recognition parameter-efficient fine-tuning
Code Code Available 1Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms Aug 19, 2024 Action Recognition Mamba
Code Code Available 1Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization Aug 12, 2024 Action Classification Action Localization
Code Code Available 1HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization Aug 12, 2024 Action Localization Temporal Action Localization
Code Code Available 1EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition Aug 10, 2024 Action Classification Action Recognition
Code Code Available 1Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism Jul 18, 2024 Action Localization Temporal Action Localization
Code Code Available 1ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos Jul 17, 2024 Action Detection Action Localization
Code Code Available 1Augmented Neural Fine-Tuning for Efficient Backdoor Purification Jul 14, 2024 Action Recognition Data Augmentation
Code Code Available 1Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization Jul 9, 2024 Action Localization Temporal Action Localization
Code Code Available 1DyFADet: Dynamic Feature Aggregation for Temporal Action Detection Jul 3, 2024 Action Detection Dynamic neural networks
Code Code Available 1Referring Atomic Video Action Recognition Jul 2, 2024 Action Localization Action Recognition
Code Code Available 1Snakes and Ladders: Two Steps Up for VideoMamba Jun 27, 2024 Action Recognition Mamba
Code Code Available 1UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization Apr 4, 2024 Action Localization audio-visual event localization
Code Code Available 1A Lie Group Approach to Riemannian Batch Normalization Mar 17, 2024 Action Recognition EEG
Code Code Available 1Realigning Confidence with Temporal Saliency Information for Point-Level Weakly-Supervised Temporal Action Localization Jan 1, 2024 Action Localization Temporal Action Localization
Code Code Available 1A Dense-Sparse Complementary Network for Human Action Recognition based on RGB and Skeleton Modalities Dec 28, 2023 Action Recognition Action Recognition In Videos
Code Code Available 1Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach Dec 21, 2023 Action Localization Classification
Code Code Available 1Generative Model-based Feature Knowledge Distillation for Action Recognition Dec 14, 2023 Action Detection Action Recognition
Code Code Available 1EZ-CLIP: Efficient Zeroshot Video Action Recognition Dec 13, 2023 Action Recognition GPU
Code Code Available 1