| TubeR: Tubelet Transformer for Video Action Detection | Apr 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Busy-Quiet Video Disentangling for Video Classification | Mar 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| ViViT: A Video Vision Transformer | Mar 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| An Image is Worth 16x16 Words, What is a Video Worth? | Mar 25, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MoViNets: Mobile Video Networks for Efficient Video Recognition | Mar 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Revisiting ResNets: Improved Training and Scaling Strategies | Mar 13, 2021 | Action ClassificationDocument Image Classification | CodeCode Available | 1 |
| TCLR: Temporal Contrastive Learning for Video Representation | Jan 20, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| TDN: Temporal Difference Networks for Efficient Action Recognition | Dec 18, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MVFNet: Multi-View Fusion Network for Efficient Video Recognition | Dec 13, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks | Nov 23, 2020 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Boundary-sensitive Pre-training for Temporal Localization in Videos | Nov 21, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| Mutual Modality Learning for Video Action Classification | Nov 4, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing | Sep 30, 2020 | Action ClassificationVideo Recognition | CodeCode Available | 1 |
| Memory-augmented Dense Predictive Coding for Video Representation Learning | Aug 3, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Jul 23, 2020 | Action ClassificationKeyword Spotting | CodeCode Available | 1 |
| MotionSqueeze: Neural Motion Feature Learning for Video Understanding | Jul 20, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Region-based Non-local Operation for Video Classification | Jul 17, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Alleviating Over-segmentation Errors by Detecting Action Boundaries | Jul 14, 2020 | Action ClassificationAction Segmentation | CodeCode Available | 1 |
| AViD Dataset: Anonymized Videos from Diverse Countries | Jul 10, 2020 | Action ClassificationAction Detection | CodeCode Available | 1 |
| VPN: Learning Video-Pose Embedding for Activities of Daily Living | Jul 6, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition | Jun 20, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Weakly-supervised Temporal Action Localization by Uncertainty Modeling | Jun 12, 2020 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Can Deep Learning Recognize Subtle Human Activities? | Mar 30, 2020 | Action ClassificationDeep Learning | CodeCode Available | 1 |
| Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Mar 17, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| Infrared and 3D skeleton feature fusion for RGB-D action recognition | Feb 28, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Over-the-Air Adversarial Flickering Attacks against Video Recognition Networks | Feb 12, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| Learning Spatiotemporal Features via Video and Text Pair Discrimination | Jan 16, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison | Oct 24, 2019 | Action ClassificationBenchmarking | CodeCode Available | 1 |
| An Evaluation of Action Recognition Models on EPIC-Kitchens | Aug 2, 2019 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Large Scale Holistic Video Understanding | Apr 25, 2019 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment | Apr 8, 2019 | Action ClassificationAction Quality Assessment | CodeCode Available | 1 |
| High Quality Monocular Depth Estimation via Transfer Learning | Dec 31, 2018 | Action ClassificationDecoder | CodeCode Available | 1 |
| SlowFast Networks for Video Recognition | Dec 10, 2018 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Timeception for Complex Action Recognition | Dec 4, 2018 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| TSM: Temporal Shift Module for Efficient Video Understanding | Nov 20, 2018 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos | Apr 12, 2018 | Action ClassificationAction Detection | CodeCode Available | 1 |
| A Closer Look at Spatiotemporal Convolutions for Action Recognition | Nov 30, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Non-local Neural Networks | Nov 21, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| ConvNet Architecture Search for Spatiotemporal Feature Learning | Aug 16, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset | May 22, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| The Kinetics Human Action Video Dataset | May 19, 2017 | Action ClassificationGeneral Classification | CodeCode Available | 1 |
| Skeleton-based Action Recognition with Convolutional Neural Networks | Apr 25, 2017 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Visual Semantic Role Labeling | May 17, 2015 | 16kAction Classification | CodeCode Available | 1 |
| SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis | Jun 9, 2025 | Action ClassificationBenchmarking | —Unverified | 0 |
| From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos | Jun 5, 2025 | Action ClassificationComposed Video Retrieval (CoVR) | CodeCode Available | 0 |
| Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition | May 29, 2025 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding | May 22, 2025 | Action ClassificationAutomatic Speech Recognition | CodeCode Available | 0 |
| Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes | May 21, 2025 | Action ClassificationPose Tracking | —Unverified | 0 |
| Domain Adaptation of VLM for Soccer Video Understanding | May 20, 2025 | Action ClassificationDomain Adaptation | —Unverified | 0 |
| OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition | Mar 30, 2025 | Action ClassificationAction Recognition | —Unverified | 0 |