| YourSkatingCoach: A Figure Skating Video Benchmark for Fine-Grained Element Analysis | Oct 27, 2024 | Action Classification | —Unverified | 0 |
| Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning | Apr 6, 2021 | Action ClassificationAction Detection | —Unverified | 0 |
| Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark | Dec 16, 2021 | Action ClassificationAction Detection | CodeCode Available | 0 |
| VideoBERT: A Joint Model for Video and Language Representation Learning | Apr 3, 2019 | Action ClassificationGeneral Classification | CodeCode Available | 0 |
| Actor-identified Spatiotemporal Action Detection --- Detecting Who Is Doing What in Videos | Aug 27, 2022 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition | May 29, 2025 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition | Jul 1, 2017 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture | Nov 27, 2017 | Action ClassificationAttribute | CodeCode Available | 0 |
| Video Classification with Channel-Separated Convolutional Networks | Apr 4, 2019 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Hierarchical Graph-Convolutional Variational AutoEncoding for Generative Modelling of Human Motion | Nov 24, 2021 | Action ClassificationTrajectory Prediction | CodeCode Available | 0 |
| A Probabilistic Semi-Supervised Approach to Multi-Task Human Activity Modeling | Sep 24, 2018 | Action ClassificationGeneral Classification | CodeCode Available | 0 |
| Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection | Apr 3, 2017 | Action ClassificationAction Localization | CodeCode Available | 0 |
| VideoLSTM Convolves, Attends and Flows for Action Recognition | Jul 6, 2016 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Can x2vec Save Lives? Integrating Graph and Language Embeddings for Automatic Mental Health Classification | Jan 4, 2020 | Action ClassificationActivity Prediction | CodeCode Available | 0 |
| Support Vector Machines with Time Series Distance Kernels for Action Classification | Mar 7, 2016 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms | Feb 6, 2023 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Hierarchical Explanations for Video Action Recognition | Jan 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Synthetic Humans for Action Recognition from Unseen Viewpoints | Dec 9, 2019 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| TAda! Temporally-Adaptive Convolutions for Video Understanding | Oct 12, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning | Oct 14, 2020 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Jun 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN | Dec 10, 2019 | Action AnticipationAction Classification | CodeCode Available | 0 |
| Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs | Jan 9, 2016 | Action ClassificationAction Localization | CodeCode Available | 0 |
| HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization | Dec 26, 2017 | Action ClassificationAction Localization | CodeCode Available | 0 |
| A^2-Nets: Double Attention Networks | Dec 1, 2018 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Graph Distillation for Action Detection with Privileged Modalities | Nov 30, 2017 | Action ClassificationAction Detection | CodeCode Available | 0 |
| AssembleNet++: Assembling Modality Representations via Attention Connections | Aug 18, 2020 | Action ClassificationActivity Recognition | CodeCode Available | 0 |
| temporal driver action Localization using action classifications method | Jun 11, 2022 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Audiovisual SlowFast Networks for Video Recognition | Jan 23, 2020 | Action ClassificationVideo Recognition | CodeCode Available | 0 |
| Video Transformer Network | Feb 1, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| A Short Note about Kinetics-600 | Aug 3, 2018 | Action Classification | CodeCode Available | 0 |
| Temporal Relational Reasoning in Videos | Nov 22, 2017 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Graph Convolutional Networks for Temporal Action Localization | Sep 7, 2019 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Graph-Based Global Reasoning Networks | Nov 30, 2018 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Global Semantic Descriptors for Zero-Shot Action Recognition | Sep 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions | Nov 24, 2024 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| 3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization | Aug 22, 2019 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors | Nov 17, 2024 | Action ClassificationSegmentation | CodeCode Available | 0 |
| From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos | Jun 5, 2025 | Action ClassificationComposed Video Retrieval (CoVR) | CodeCode Available | 0 |
| Pose And Joint-Aware Action Recognition | Oct 16, 2020 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Fine-Grained Action Detection with RGB and Pose Information using Two Stream Convolutional Networks | Feb 6, 2023 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Object Priors for Classifying and Localizing Unseen Actions | Apr 10, 2021 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Attention Bottlenecks for Multimodal Fusion | Jun 30, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| What makes ImageNet good for transfer learning? | Aug 30, 2016 | Action ClassificationGeneral Classification | CodeCode Available | 0 |
| NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis | Apr 11, 2016 | 3D Action RecognitionAction Classification | CodeCode Available | 0 |
| NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels | Oct 13, 2021 | Action ClassificationSelf-Supervised Learning | CodeCode Available | 0 |
| Progression-Guided Temporal Action Detection in Videos | Aug 18, 2023 | Action ClassificationAction Detection | CodeCode Available | 0 |
| EVA: Exploring the Limits of Masked Visual Representation Learning at Scale | Nov 14, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Multilevel semantic and adaptive actionness learning for weakly supervised temporal action localization | Nov 24, 2024 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Do we really need temporal convolutions in action segmentation? | May 26, 2022 | Action ClassificationAction Segmentation | CodeCode Available | 0 |