| Masked Feature Prediction for Self-Supervised Visual Pre-Training | Dec 16, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Self-supervised Video Transformer | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Revisiting spatio-temporal layouts for compositional action recognition | Nov 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis | Sep 29, 2021 | Action ClassificationClassification | CodeCode Available | 1 |
| Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |