| AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Apr 24, 2023 | 3D Hand Pose EstimationAction Classification | CodeCode Available | 1 | 5 |
| A Closer Look at Spatiotemporal Convolutions for Action Recognition | Nov 30, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Multiscale Vision Transformers | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 | 5 |
| MotionSqueeze: Neural Motion Feature Learning for Video Understanding | Jul 20, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Just Add π! Pose Induced Video Transformers for Understanding Activities of Daily Living | Nov 30, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders | Nov 16, 2022 | Action ClassificationRepresentation Learning | CodeCode Available | 1 | 5 |
| Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition | Nov 8, 2024 | Action ClassificationActivity Recognition | CodeCode Available | 1 | 5 |
| AViD Dataset: Anonymized Videos from Diverse Countries | Jul 10, 2020 | Action ClassificationAction Detection | CodeCode Available | 1 | 5 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 | 5 |
| Weakly-supervised Temporal Action Localization by Uncertainty Modeling | Jun 12, 2020 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| KNN-MMD: Cross Domain Wireless Sensing via Local Distribution Alignment | Dec 6, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 | 5 |
| MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos | May 26, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| MoViNets: Mobile Video Networks for Efficient Video Recognition | Mar 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Mutual Modality Learning for Video Action Classification | Nov 4, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization | Aug 12, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition | Aug 10, 2024 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Jun 13, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| Memory-augmented Dense Predictive Coding for Video Representation Learning | Aug 3, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| ALIP: Adaptive Language-Image Pre-training with Synthetic Caption | Aug 16, 2023 | Action ClassificationImage-text Retrieval | CodeCode Available | 1 | 5 |
| CAST: Cross-Attention in Space and Time for Video Action Recognition | Nov 30, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Masked Feature Prediction for Self-Supervised Visual Pre-Training | Dec 16, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Can Deep Learning Recognize Subtle Human Activities? | Mar 30, 2020 | Action ClassificationDeep Learning | CodeCode Available | 1 | 5 |
| MAR: Masked Autoencoders for Efficient Action Recognition | Jul 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Class-Difficulty Based Methods for Long-Tailed Visual Recognition | Jul 29, 2022 | Action Classificationimage-classification | CodeCode Available | 1 | 5 |
| Alleviating Over-segmentation Errors by Detecting Action Boundaries | Jul 14, 2020 | Action ClassificationAction Segmentation | CodeCode Available | 1 | 5 |
| ActionCLIP: A New Paradigm for Video Action Recognition | Sep 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Enriching Local and Global Contexts for Temporal Action Localization | Jul 27, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| An Empirical Study of End-to-End Temporal Action Detection | Apr 6, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 | 5 |
| Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning | Mar 28, 2022 | Action ClassificationContrastive Learning | CodeCode Available | 1 | 5 |
| CoCa: Contrastive Captioners are Image-Text Foundation Models | May 4, 2022 | Action ClassificationDecoder | CodeCode Available | 1 | 5 |
| An Evaluation of Action Recognition Models on EPIC-Kitchens | Aug 2, 2019 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Finding the Missing Data: A BERT-inspired Approach Against Package Loss in Wireless Sensing | Mar 19, 2024 | Action ClassificationDeep Learning | CodeCode Available | 1 | 5 |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | May 31, 2021 | Action ClassificationVideo Recognition | CodeCode Available | 1 | 5 |
| MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition | Jan 20, 2022 | Action AnticipationAction Classification | CodeCode Available | 1 | 5 |
| ConvNet Architecture Search for Spatiotemporal Feature Learning | Aug 16, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Dual-path Adaptation from Image to Video Transformers | Mar 17, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Frozen CLIP Models are Efficient Video Learners | Aug 6, 2022 | Action ClassificationDecoder | CodeCode Available | 1 | 5 |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Jul 23, 2020 | Action ClassificationKeyword Spotting | CodeCode Available | 1 | 5 |
| CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network | Aug 20, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 | 5 |
| CT-Net: Channel Tensorization Network for Video Classification | Jun 3, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Open-Vocabulary Video Relation Extraction | Dec 25, 2023 | Action ClassificationAction Understanding | CodeCode Available | 1 | 5 |
| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games | Jul 12, 2021 | Action ClassificationActivity Recognition | CodeCode Available | 1 | 5 |
| HierVL: Learning Hierarchical Video-Language Embeddings | Jan 5, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| High Quality Monocular Depth Estimation via Transfer Learning | Dec 31, 2018 | Action ClassificationDecoder | CodeCode Available | 1 | 5 |
| Implicit Temporal Modeling with Learnable Alignment for Video Recognition | Apr 20, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Learning To Recognize Procedural Activities with Distant Supervision | Jan 26, 2022 | Action ClassificationLanguage Modelling | CodeCode Available | 1 | 5 |