| Hierarchical Graph-Convolutional Variational AutoEncoding for Generative Modelling of Human Motion | Nov 24, 2021 | Action ClassificationTrajectory Prediction | CodeCode Available | 0 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Revisiting spatio-temporal layouts for compositional action recognition | Nov 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| MetaVD: A Meta Video Dataset for enhancing human action recognition datasets | Nov 1, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels | Oct 13, 2021 | Action ClassificationSelf-Supervised Learning | CodeCode Available | 0 |
| TAda! Temporally-Adaptive Convolutions for Video Understanding | Oct 12, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning | Sep 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |