| UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition | Jul 19, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games | Jul 12, 2021 | Action ClassificationActivity Recognition | CodeCode Available | 1 |
| Attention Bottlenecks for Multimodal Fusion | Jun 30, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Video Swin Transformer | Jun 24, 2021 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification | Jun 21, 2021 | Action ClassificationClassification | CodeCode Available | 0 |
| VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning | Jun 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| TokenLearner: What Can 8 Learned Tokens Do for Images and Videos? | Jun 21, 2021 | Action ClassificationImage Classification | CodeCode Available | 1 |
| Proposal Relation Network for Temporal Action Detection | Jun 20, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| Space-time Mixing Attention for Video Transformer | Jun 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers | Jun 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| CT-Net: Channel Tensorization Network for Video Classification | Jun 3, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | May 31, 2021 | Action ClassificationVideo Recognition | CodeCode Available | 1 |
| Distributed Learning with Strategic Users: A Repeated Game Approach | May 21, 2021 | Action Classification | —Unverified | 0 |
| VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living | May 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 |
| Unsupervised Visual Representation Learning by Tracking Patches in Video | May 6, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VidTr: Video Transformer Without Convolutions | Apr 23, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| Multiscale Vision Transformers | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Temporal Query Networks for Fine-grained Video Understanding | Apr 19, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| Adaptive Intermediate Representations for Video Understanding | Apr 14, 2021 | Action ClassificationOptical Flow Estimation | —Unverified | 0 |
| Object Priors for Classifying and Localizing Unseen Actions | Apr 10, 2021 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning | Apr 6, 2021 | Action ClassificationAction Detection | —Unverified | 0 |
| TubeR: Tubelet Transformer for Video Action Detection | Apr 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |