| Omnivore: A Single Model for Many Visual Modalities | Jan 20, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 | 5 |
| Learning Video Representations from Large Language Models | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 | 5 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 | 5 |
| Weakly-supervised Temporal Action Localization by Uncertainty Modeling | Jun 12, 2020 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition | Mar 19, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 | 5 |
| Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition | Nov 8, 2024 | Action ClassificationActivity Recognition | CodeCode Available | 1 | 5 |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders | Nov 16, 2022 | Action ClassificationRepresentation Learning | CodeCode Available | 1 | 5 |
| AViD Dataset: Anonymized Videos from Diverse Countries | Jul 10, 2020 | Action ClassificationAction Detection | CodeCode Available | 1 | 5 |
| A Closer Look at Spatiotemporal Convolutions for Action Recognition | Nov 30, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |