| UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition | Jul 19, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games | Jul 12, 2021 | Action ClassificationActivity Recognition | CodeCode Available | 1 |
| Attention Bottlenecks for Multimodal Fusion | Jun 30, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Video Swin Transformer | Jun 24, 2021 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning | Jun 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| TokenLearner: What Can 8 Learned Tokens Do for Images and Videos? | Jun 21, 2021 | Action ClassificationImage Classification | CodeCode Available | 1 |
| TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification | Jun 21, 2021 | Action ClassificationClassification | CodeCode Available | 0 |
| Proposal Relation Network for Temporal Action Detection | Jun 20, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| Space-time Mixing Attention for Video Transformer | Jun 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers | Jun 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| CT-Net: Channel Tensorization Network for Video Classification | Jun 3, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | May 31, 2021 | Action ClassificationVideo Recognition | CodeCode Available | 1 |
| Distributed Learning with Strategic Users: A Repeated Game Approach | May 21, 2021 | Action Classification | —Unverified | 0 |
| VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living | May 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 |
| Unsupervised Visual Representation Learning by Tracking Patches in Video | May 6, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VidTr: Video Transformer Without Convolutions | Apr 23, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Multiscale Vision Transformers | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Temporal Query Networks for Fine-grained Video Understanding | Apr 19, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| Adaptive Intermediate Representations for Video Understanding | Apr 14, 2021 | Action ClassificationOptical Flow Estimation | —Unverified | 0 |
| Object Priors for Classifying and Localizing Unseen Actions | Apr 10, 2021 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning | Apr 6, 2021 | Action ClassificationAction Detection | —Unverified | 0 |
| TubeR: Tubelet Transformer for Video Action Detection | Apr 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Contrastive Learning of Single-Cell Phenotypic Representations for Treatment Classification | Mar 30, 2021 | Action ClassificationClassification | —Unverified | 0 |
| Recognizing Actions in Videos from Unseen Viewpoints | Mar 30, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| Busy-Quiet Video Disentangling for Video Classification | Mar 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| ViViT: A Video Vision Transformer | Mar 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization | Mar 28, 2021 | Action ClassificationAction Localization | —Unverified | 0 |
| An Image is Worth 16x16 Words, What is a Video Worth? | Mar 25, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MoViNets: Mobile Video Networks for Efficient Video Recognition | Mar 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Revisiting ResNets: Improved Training and Scaling Strategies | Mar 13, 2021 | Action ClassificationDocument Image Classification | CodeCode Available | 1 |
| Domain and View-point Agnostic Hand Action Recognition | Mar 3, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Is Space-Time Attention All You Need for Video Understanding? | Feb 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Video Transformer Network | Feb 1, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| TCLR: Temporal Contrastive Learning for Video Representation | Jan 20, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Human Action Recognition Based on Multi-scale Feature Maps from Depth Video Sequences | Jan 19, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| Watch Only Once: An End-to-End Video Action Detection Framework | Jan 1, 2021 | Action ClassificationAction Detection | —Unverified | 0 |
| TDN: Temporal Difference Networks for Efficient Action Recognition | Dec 18, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Weakly-Supervised Action Localization and Action Recognition using Global-Local Attention of 3D CNN | Dec 17, 2020 | Action ClassificationAction Localization | —Unverified | 0 |
| MVFNet: Multi-View Fusion Network for Efficient Video Recognition | Dec 13, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Hierarchical Human Action Classification with Network Pruning | Dec 7, 2020 | Action ClassificationClassification | —Unverified | 0 |
| Real-time Spatio-temporal Action Localization via Learning Motion Representation | Nov 30, 2020 | Action ClassificationAction Localization | —Unverified | 0 |
| Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal Heatmaps | Nov 26, 2020 | Action ClassificationAction Recognition | —Unverified | 0 |
| TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks | Nov 23, 2020 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Boundary-sensitive Pre-training for Temporal Localization in Videos | Nov 21, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| 3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks | Nov 20, 2020 | Action ClassificationClassification | —Unverified | 0 |
| Improved Soccer Action Spotting using both Audio and Video Streams | Nov 9, 2020 | Action ClassificationAction Spotting | —Unverified | 0 |
| Mutual Modality Learning for Video Action Classification | Nov 4, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |