| CoCa: Contrastive Captioners are Image-Text Foundation Models | May 4, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| Machine Learning and Signal Processing Based Analysis of sEMG Signals for Daily Action Classification | Apr 12, 2022 | Action Classification | —Unverified | 0 |
| An Empirical Study of End-to-End Temporal Action Detection | Apr 6, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Deformable Video Transformer | Mar 31, 2022 | Action Classification | —Unverified | 0 |
| SPAct: Self-supervised Privacy Preservation for Action Recognition | Mar 29, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning | Mar 28, 2022 | Action ClassificationContrastive Learning | CodeCode Available | 1 |
| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 |
| Point3D: tracking actions as moving points with 3D CNNs | Mar 20, 2022 | Action ClassificationAction Localization | —Unverified | 0 |
| DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition | Mar 19, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Know your sensORs -- A Modality Study For Surgical Action Classification | Mar 16, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| OpenTAL: Towards Open Set Temporal Action Localization | Mar 10, 2022 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework | Mar 8, 2022 | 3D Human Pose EstimationAction Classification | CodeCode Available | 0 |
| Delving Deep into One-Shot Skeleton-based Action Recognition with Diverse Occlusions | Feb 23, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision | Feb 16, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Learning To Recognize Procedural Activities with Distant Supervision | Jan 26, 2022 | Action ClassificationLanguage Modelling | CodeCode Available | 1 |
| Omnivore: A Single Model for Many Visual Modalities | Jan 20, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition | Jan 20, 2022 | Action AnticipationAction Classification | CodeCode Available | 1 |
| End-to-end Generative Pretraining for Multimodal Video Captioning | Jan 20, 2022 | Action ClassificationDecoder | —Unverified | 0 |
| Video Transformers: A Survey | Jan 16, 2022 | Action ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Multiview Transformers for Video Recognition | Jan 12, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound | Jan 7, 2022 | Action ClassificationNavigate | —Unverified | 0 |
| Improving Video Model Transfer With Dynamic Representation Learning | Jan 1, 2022 | Action ClassificationKnowledge Distillation | —Unverified | 0 |
| Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark | Dec 16, 2021 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Masked Feature Prediction for Self-Supervised Visual Pre-Training | Dec 16, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Co-training Transformer with Videos and Images Improves Action Recognition | Dec 14, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Self-supervised Video Transformer | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| PreViTS: Contrastive Pretraining with Video Tracking Supervision | Dec 1, 2021 | Action ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Low-Fidelity Video Encoder Optimization for Temporal Action Localization | Dec 1, 2021 | Action ClassificationAction Localization | —Unverified | 0 |
| Reformulating Zero-shot Action Recognition for Multi-label Actions | Dec 1, 2021 | Action ClassificationAction Detection | —Unverified | 0 |
| Hierarchical Graph-Convolutional Variational AutoEncoding for Generative Modelling of Human Motion | Nov 24, 2021 | Action ClassificationTrajectory Prediction | CodeCode Available | 0 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Revisiting spatio-temporal layouts for compositional action recognition | Nov 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| MetaVD: A Meta Video Dataset for enhancing human action recognition datasets | Nov 1, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels | Oct 13, 2021 | Action ClassificationSelf-Supervised Learning | CodeCode Available | 0 |
| TAda! Temporally-Adaptive Convolutions for Video Understanding | Oct 12, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis | Sep 29, 2021 | Action ClassificationClassification | CodeCode Available | 1 |
| Class incremental learning for video action classification | Sep 19, 2021 | Action ClassificationAction Recognition In Videos | —Unverified | 0 |
| Unsupervised View-Invariant Human Posture Representation | Sep 17, 2021 | 3D Action Recognition3D Pose Estimation | —Unverified | 0 |
| ActionCLIP: A New Paradigm for Video Action Recognition | Sep 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Revisiting 3D ResNets for Video Recognition | Sep 3, 2021 | Action ClassificationContrastive Learning | CodeCode Available | 0 |
| roadscene2vec: A Tool for Extracting and Embedding Road Scene-Graphs | Sep 2, 2021 | Action ClassificationGraph Embedding | CodeCode Available | 1 |
| Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition | Aug 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Pose is all you need: The pose only group activity recognition system (POGARS) | Aug 9, 2021 | Action ClassificationActivity Prediction | —Unverified | 0 |
| Video Contrastive Learning with Global Context | Aug 5, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Enriching Local and Global Contexts for Temporal Action Localization | Jul 27, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |