| Unsupervised Video Understanding by Reconciliation of Posture Similarities | Aug 3, 2017 | Action ClassificationRetrieval | —Unverified | 0 | 0 |
| Multi-Fiber Networks for Video Recognition | Jul 30, 2018 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| Unsupervised View-Invariant Human Posture Representation | Sep 17, 2021 | 3D Action Recognition3D Pose Estimation | —Unverified | 0 | 0 |
| Multi-Level Sequence GAN for Group Activity Recognition | Dec 18, 2018 | Action ClassificationActivity Prediction | —Unverified | 0 | 0 |
| Multi-modal Prompting for Low-Shot Temporal Action Localization | Mar 21, 2023 | Action ClassificationAction Localization | —Unverified | 0 | 0 |
| Multi-Modal Three-Stream Network for Action Recognition | Sep 8, 2019 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network | Jan 8, 2019 | Action ClassificationLanguage Modelling | —Unverified | 0 | 0 |
| Multiview Transformers for Video Recognition | Jan 12, 2022 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| FASTER Recurrent Networks for Efficient Video Classification | Jun 10, 2019 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| FACTS: Fine-Grained Action Classification for Tactical Sports | Dec 21, 2024 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| Natural Language Descriptions of Human Activities Scenes: Corpus Generation and Analysis | Aug 1, 2016 | Action ClassificationObject Recognition | —Unverified | 0 | 0 |
| Representation Learning on Visual-Symbolic Graphs for Video Understanding | May 17, 2019 | Action ClassificationAction Detection | —Unverified | 0 | 0 |
| Actor-Centric Relation Network | Jul 28, 2018 | Action ClassificationAction Detection | —Unverified | 0 | 0 |
| No More Shortcuts: Realizing the Potential of Temporal Self-Supervision | Dec 20, 2023 | Action ClassificationAttribute | —Unverified | 0 | 0 |
| Evolving Space-Time Neural Architectures for Videos | Nov 26, 2018 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| Activity Driven Weakly Supervised Object Detection | Apr 2, 2019 | Action ClassificationObject | —Unverified | 0 | 0 |
| ActionVLAD: Learning spatio-temporal aggregation for action classification | Apr 10, 2017 | Action ClassificationClassification | —Unverified | 0 | 0 |
| VicTR: Video-conditioned Text Representations for Activity Recognition | Apr 5, 2023 | Action ClassificationActivity Recognition | —Unverified | 0 | 0 |
| Ensembles of Deep Neural Networks for Action Recognition in Still Images | Mar 22, 2020 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| OmniVec2 - A Novel Transformer based Network for Large Scale Multimodal and Multitask Learning | Jan 1, 2024 | 3D Point Cloud ClassificationAction Classification | —Unverified | 0 | 0 |
| OmniVec: Learning robust representations with cross modal sharing | Nov 7, 2023 | 3D Point Cloud ClassificationAction Classification | —Unverified | 0 | 0 |
| OmniVL:One Foundation Model for Image-Language and Video-Language Tasks | Sep 15, 2022 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| Enhancing Video Transformers for Action Understanding with VLM-aided Training | Mar 24, 2024 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| End-to-end Generative Pretraining for Multimodal Video Captioning | Jan 20, 2022 | Action ClassificationDecoder | —Unverified | 0 | 0 |
| End-to-End Fine-Grained Action Segmentation and Recognition Using Conditional Random Field Models and Discriminative Sparse Coding | Jan 29, 2018 | Action ClassificationAction Segmentation | —Unverified | 0 | 0 |