| ALIP: Adaptive Language-Image Pre-training with Synthetic Caption | Aug 16, 2023 | Action ClassificationImage-text Retrieval | CodeCode Available | 1 |
| Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition | Aug 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Enriching Local and Global Contexts for Temporal Action Localization | Jul 27, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |
| HierVL: Learning Hierarchical Video-Language Embeddings | Jan 5, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| ViViT: A Video Vision Transformer | Mar 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Just Add π! Pose Induced Video Transformers for Understanding Activities of Daily Living | Nov 30, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition | Aug 10, 2024 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| KNN-MMD: Cross Domain Wireless Sensing via Local Distribution Alignment | Dec 6, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 |
| Learning Spatiotemporal Features via Video and Text Pair Discrimination | Jan 16, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MotionSqueeze: Neural Motion Feature Learning for Video Understanding | Jul 20, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Learning To Recognize Procedural Activities with Distant Supervision | Jan 26, 2022 | Action ClassificationLanguage Modelling | CodeCode Available | 1 |
| ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning | Jun 27, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Make Your Training Flexible: Towards Deployment-Efficient Video Models | Mar 18, 2025 | Action ClassificationZero-Shot Video Retrieval | CodeCode Available | 1 |
| Finding the Missing Data: A BERT-inspired Approach Against Package Loss in Wireless Sensing | Mar 19, 2024 | Action ClassificationDeep Learning | CodeCode Available | 1 |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Jul 23, 2020 | Action ClassificationKeyword Spotting | CodeCode Available | 1 |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Memory-augmented Dense Predictive Coding for Video Representation Learning | Aug 3, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos | Apr 12, 2018 | Action ClassificationAction Detection | CodeCode Available | 1 |
| End-to-End Fine-Grained Action Segmentation and Recognition Using Conditional Random Field Models and Discriminative Sparse Coding | Jan 29, 2018 | Action ClassificationAction Segmentation | —Unverified | 0 |
| Adaptive Intermediate Representations for Video Understanding | Apr 14, 2021 | Action ClassificationOptical Flow Estimation | —Unverified | 0 |
| Egocentric Audio-Visual Noise Suppression | Nov 7, 2022 | Action ClassificationEvent Detection | —Unverified | 0 |
| ActionBytes: Learning From Trimmed Videos to Localize Actions | Jun 1, 2020 | Action ClassificationAction Localization | —Unverified | 0 |
| IMUVIE: Pickup Timeline Action Localization via Motion Movies | Nov 19, 2024 | Action ClassificationAction Localization | —Unverified | 0 |
| Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification | Aug 31, 2016 | 3D ArchitectureAction Classification | —Unverified | 0 |
| Efficient Optimization for Average Precision SVM | Dec 1, 2014 | Action ClassificationGeneral Classification | —Unverified | 0 |