| CoCa: Contrastive Captioners are Image-Text Foundation Models | May 4, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| Machine Learning and Signal Processing Based Analysis of sEMG Signals for Daily Action Classification | Apr 12, 2022 | Action Classification | —Unverified | 0 |
| An Empirical Study of End-to-End Temporal Action Detection | Apr 6, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Deformable Video Transformer | Mar 31, 2022 | Action Classification | —Unverified | 0 |
| SPAct: Self-supervised Privacy Preservation for Action Recognition | Mar 29, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning | Mar 28, 2022 | Action ClassificationContrastive Learning | CodeCode Available | 1 |
| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 |
| Point3D: tracking actions as moving points with 3D CNNs | Mar 20, 2022 | Action ClassificationAction Localization | —Unverified | 0 |
| DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition | Mar 19, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Know your sensORs -- A Modality Study For Surgical Action Classification | Mar 16, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| OpenTAL: Towards Open Set Temporal Action Localization | Mar 10, 2022 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework | Mar 8, 2022 | 3D Human Pose EstimationAction Classification | CodeCode Available | 0 |
| Delving Deep into One-Shot Skeleton-based Action Recognition with Diverse Occlusions | Feb 23, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision | Feb 16, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Learning To Recognize Procedural Activities with Distant Supervision | Jan 26, 2022 | Action ClassificationLanguage Modelling | CodeCode Available | 1 |
| MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition | Jan 20, 2022 | Action AnticipationAction Classification | CodeCode Available | 1 |
| Omnivore: A Single Model for Many Visual Modalities | Jan 20, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| End-to-end Generative Pretraining for Multimodal Video Captioning | Jan 20, 2022 | Action ClassificationDecoder | —Unverified | 0 |
| Video Transformers: A Survey | Jan 16, 2022 | Action ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Multiview Transformers for Video Recognition | Jan 12, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound | Jan 7, 2022 | Action ClassificationNavigate | —Unverified | 0 |
| Improving Video Model Transfer With Dynamic Representation Learning | Jan 1, 2022 | Action ClassificationKnowledge Distillation | —Unverified | 0 |
| Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark | Dec 16, 2021 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Masked Feature Prediction for Self-Supervised Visual Pre-Training | Dec 16, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Co-training Transformer with Videos and Images Improves Action Recognition | Dec 14, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |