| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Revisiting spatio-temporal layouts for compositional action recognition | Nov 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis | Sep 29, 2021 | Action ClassificationClassification | CodeCode Available | 1 |
| ActionCLIP: A New Paradigm for Video Action Recognition | Sep 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| roadscene2vec: A Tool for Extracting and Embedding Road Scene-Graphs | Sep 2, 2021 | Action ClassificationGraph Embedding | CodeCode Available | 1 |
| Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition | Aug 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Video Contrastive Learning with Global Context | Aug 5, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Enriching Local and Global Contexts for Temporal Action Localization | Jul 27, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |
| UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition | Jul 19, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games | Jul 12, 2021 | Action ClassificationActivity Recognition | CodeCode Available | 1 |
| VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning | Jun 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| TokenLearner: What Can 8 Learned Tokens Do for Images and Videos? | Jun 21, 2021 | Action ClassificationImage Classification | CodeCode Available | 1 |
| Proposal Relation Network for Temporal Action Detection | Jun 20, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| Space-time Mixing Attention for Video Transformer | Jun 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers | Jun 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| CT-Net: Channel Tensorization Network for Video Classification | Jun 3, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | May 31, 2021 | Action ClassificationVideo Recognition | CodeCode Available | 1 |
| VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living | May 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 |
| Unsupervised Visual Representation Learning by Tracking Patches in Video | May 6, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Multiscale Vision Transformers | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |