| UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition | Jul 19, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games | Jul 12, 2021 | Action ClassificationActivity Recognition | CodeCode Available | 1 |
| Attention Bottlenecks for Multimodal Fusion | Jun 30, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Video Swin Transformer | Jun 24, 2021 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification | Jun 21, 2021 | Action ClassificationClassification | CodeCode Available | 0 |
| VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning | Jun 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| TokenLearner: What Can 8 Learned Tokens Do for Images and Videos? | Jun 21, 2021 | Action ClassificationImage Classification | CodeCode Available | 1 |
| Proposal Relation Network for Temporal Action Detection | Jun 20, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| Space-time Mixing Attention for Video Transformer | Jun 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |