| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Self-supervised Video Transformer | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| PreViTS: Contrastive Pretraining with Video Tracking Supervision | Dec 1, 2021 | Action ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Low-Fidelity Video Encoder Optimization for Temporal Action Localization | Dec 1, 2021 | Action ClassificationAction Localization | —Unverified | 0 |
| Reformulating Zero-shot Action Recognition for Multi-label Actions | Dec 1, 2021 | Action ClassificationAction Detection | —Unverified | 0 |
| Hierarchical Graph-Convolutional Variational AutoEncoding for Generative Modelling of Human Motion | Nov 24, 2021 | Action ClassificationTrajectory Prediction | CodeCode Available | 0 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Revisiting spatio-temporal layouts for compositional action recognition | Nov 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| MetaVD: A Meta Video Dataset for enhancing human action recognition datasets | Nov 1, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels | Oct 13, 2021 | Action ClassificationSelf-Supervised Learning | CodeCode Available | 0 |
| TAda! Temporally-Adaptive Convolutions for Video Understanding | Oct 12, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis | Sep 29, 2021 | Action ClassificationClassification | CodeCode Available | 1 |
| Class incremental learning for video action classification | Sep 19, 2021 | Action ClassificationAction Recognition In Videos | —Unverified | 0 |
| Unsupervised View-Invariant Human Posture Representation | Sep 17, 2021 | 3D Action Recognition3D Pose Estimation | —Unverified | 0 |
| ActionCLIP: A New Paradigm for Video Action Recognition | Sep 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Revisiting 3D ResNets for Video Recognition | Sep 3, 2021 | Action ClassificationContrastive Learning | CodeCode Available | 0 |
| roadscene2vec: A Tool for Extracting and Embedding Road Scene-Graphs | Sep 2, 2021 | Action ClassificationGraph Embedding | CodeCode Available | 1 |
| Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition | Aug 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Pose is all you need: The pose only group activity recognition system (POGARS) | Aug 9, 2021 | Action ClassificationActivity Prediction | —Unverified | 0 |
| Video Contrastive Learning with Global Context | Aug 5, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Enriching Local and Global Contexts for Temporal Action Localization | Jul 27, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |