| ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting | Aug 31, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Frozen CLIP Models are Efficient Video Learners | Aug 6, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| Class-Difficulty Based Methods for Long-Tailed Visual Recognition | Jul 29, 2022 | Action Classificationimage-classification | CodeCode Available | 1 |
| Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition | Jul 27, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MAR: Masked Autoencoders for Efficient Action Recognition | Jul 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| ReAct: Temporal Action Detection with Relational Queries | Jul 14, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning | Jun 27, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos | Jun 25, 2022 | Action ClassificationClustering | CodeCode Available | 1 |
| Stand-Alone Inter-Frame Attention in Video Models | Jun 14, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector | Jun 7, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos | May 26, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| CoCa: Contrastive Captioners are Image-Text Foundation Models | May 4, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| An Empirical Study of End-to-End Temporal Action Detection | Apr 6, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| SPAct: Self-supervised Privacy Preservation for Action Recognition | Mar 29, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning | Mar 28, 2022 | Action ClassificationContrastive Learning | CodeCode Available | 1 |
| DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition | Mar 19, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| OpenTAL: Towards Open Set Temporal Action Localization | Mar 10, 2022 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Delving Deep into One-Shot Skeleton-based Action Recognition with Diverse Occlusions | Feb 23, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Learning To Recognize Procedural Activities with Distant Supervision | Jan 26, 2022 | Action ClassificationLanguage Modelling | CodeCode Available | 1 |
| MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition | Jan 20, 2022 | Action AnticipationAction Classification | CodeCode Available | 1 |
| Masked Feature Prediction for Self-Supervised Visual Pre-Training | Dec 16, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Self-supervised Video Transformer | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |