| 0-MMS: Zero-Shot Multi-Motion Segmentation With A Monocular Event Camera | Jun 11, 2020 | Motion CompensationMotion Segmentation | CodeCode Available | 1 |
| TAM: Temporal Adaptive Module for Video Recognition | May 14, 2020 | Action RecognitionVideo Recognition | CodeCode Available | 1 |
| CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition | Apr 20, 2020 | Gesture RecognitionLifelong learning | CodeCode Available | 1 |
| Improved Residual Networks for Image and Video Recognition | Apr 10, 2020 | Action Recognitionimage-classification | CodeCode Available | 1 |
| Clean-Label Backdoor Attacks on Video Recognition Models | Mar 6, 2020 | Backdoor Attackbackdoor defense | CodeCode Available | 1 |
| V4D:4D Convolutional Neural Networks for Video-level Representation Learning | Feb 18, 2020 | Long-range modelingRepresentation Learning | CodeCode Available | 1 |
| Over-the-Air Adversarial Flickering Attacks against Video Recognition Networks | Feb 12, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| Large Scale Holistic Video Understanding | Apr 25, 2019 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| SlowFast Networks for Video Recognition | Dec 10, 2018 | Action ClassificationAction Detection | CodeCode Available | 1 |
| TSM: Temporal Shift Module for Efficient Video Understanding | Nov 20, 2018 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| Deep Feature Flow for Video Recognition | Nov 23, 2016 | Video RecognitionVideo Semantic Segmentation | CodeCode Available | 1 |
| Clockwork Convnets for Video Semantic Segmentation | Aug 11, 2016 | Image SegmentationScheduling | CodeCode Available | 1 |
| DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition | Jul 16, 2025 | BenchmarkingKnowledge Distillation | CodeCode Available | 0 |
| VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models | May 13, 2025 | FormMultiple-choice | CodeCode Available | 0 |
| Gameplay Highlights Generation | May 12, 2025 | Event DetectionHighlight Detection | —Unverified | 0 |
| Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos | Apr 21, 2025 | Adversarial RobustnessVideo Recognition | —Unverified | 0 |
| CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition | Mar 30, 2025 | Action ClassificationAction Recognition | —Unverified | 0 |
| Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering | Mar 27, 2025 | Emotion RecognitionQuestion Answering | —Unverified | 0 |
| VTD-CLIP: Video-to-Text Discretization via Prompting CLIP | Mar 24, 2025 | parameter-efficient fine-tuningVideo Recognition | CodeCode Available | 0 |
| Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition | Mar 17, 2025 | Action RecognitionVideo Recognition | —Unverified | 0 |
| A Simple and Efficient Baseline for Video Action Recognition | Mar 2, 2025 | Action RecognitionFine-grained Action Recognition | —Unverified | 0 |
| VideoPure: Diffusion-based Adversarial Purification for Video Recognition | Jan 25, 2025 | Adversarial DefenseAdversarial Purification | CodeCode Available | 0 |
| Action Detail Matters: Refining Video Recognition with Local Action Queries | Jan 1, 2025 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments | Dec 28, 2024 | Action LocalizationAction Recognition | —Unverified | 0 |
| Standardization Trends on Safety and Trustworthiness Technology for Advanced AI | Oct 29, 2024 | Video Recognition | —Unverified | 0 |
| MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer | Oct 14, 2024 | Transfer LearningVideo Recognition | CodeCode Available | 0 |
| A Novel Audio-Visual Information Fusion System for Mental Disorders Detection | Sep 3, 2024 | EEGVideo Recognition | —Unverified | 0 |
| GenRec: Unifying Video Generation and Recognition with Diffusion Models | Aug 27, 2024 | Image to Video GenerationVideo Generation | CodeCode Available | 0 |
| Purification Of Contaminated Convolutional Neural Networks Via Robust Recovery: An Approach with Theoretical Guarantee in One-Hidden-Layer Case | Jul 4, 2024 | image-classificationImage Classification | —Unverified | 0 |
| PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition | Jul 3, 2024 | PositionVideo Recognition | CodeCode Available | 0 |
| MeMSVD: Long-Range Temporal Structure Capturing Using Incremental SVD | Jun 11, 2024 | Video RecognitionVideo Understanding | —Unverified | 0 |
| Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions | May 28, 2024 | Action RecognitionVideo Recognition | —Unverified | 0 |
| Transfer-LMR: Heavy-Tail Driving Behavior Recognition in Diverse Traffic Scenarios | May 8, 2024 | Video Recognition | —Unverified | 0 |
| Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition | Apr 30, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model | Mar 18, 2024 | Adversarial AttackStyle Transfer | —Unverified | 0 |
| Don't Judge by the Look: Towards Motion Coherent Video Representation | Mar 14, 2024 | Data AugmentationObject Recognition | CodeCode Available | 0 |
| Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition | Feb 29, 2024 | Transfer LearningVideo Recognition | —Unverified | 0 |
| Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video Recognition | Jan 11, 2024 | Video Recognition | CodeCode Available | 0 |
| HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition | Jan 10, 2024 | Action RecognitionAction Recognition In Videos | CodeCode Available | 0 |
| Motion Guided Token Compression for Efficient Masked Video Modeling | Jan 10, 2024 | Video CompressionVideo Recognition | —Unverified | 0 |
| Efficient Selective Audio Masked Multimodal Bottleneck Transformer for Audio-Video Classification | Jan 8, 2024 | Action RecognitionContrastive Learning | —Unverified | 0 |
| Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition | Dec 18, 2023 | Video Recognition | CodeCode Available | 0 |
| LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer | Dec 15, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Automated Sperm Assessment Framework and Neural Network Specialized for Sperm Video Recognition | Nov 10, 2023 | Video Recognition | CodeCode Available | 0 |
| Object-centric Video Representation for Long-term Action Anticipation | Oct 31, 2023 | Action AnticipationHuman-Object Interaction Detection | CodeCode Available | 0 |
| On the Relevance of Temporal Features for Medical Ultrasound Video Recognition | Oct 16, 2023 | Video Recognition | CodeCode Available | 0 |
| Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer | Sep 11, 2023 | Multi-Task LearningVideo Recognition | —Unverified | 0 |
| Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving | Sep 8, 2023 | AllAutonomous Driving | —Unverified | 0 |
| Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition | Aug 22, 2023 | Multiview LearningVideo Recognition | CodeCode Available | 0 |
| Temporal-Distributed Backdoor Attack Against Video Based Action Recognition | Aug 21, 2023 | Action RecognitionBackdoor Attack | —Unverified | 0 |