| An Empirical Study of End-to-End Temporal Action Detection | Apr 6, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 | 5 |
| A Closer Look at Spatiotemporal Convolutions for Action Recognition | Nov 30, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition | Jan 20, 2022 | Action AnticipationAction Classification | CodeCode Available | 1 | 5 |
| A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector | Jun 7, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 | 5 |
| Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers | Jun 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders | Nov 16, 2022 | Action ClassificationRepresentation Learning | CodeCode Available | 1 | 5 |
| Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition | Nov 8, 2024 | Action ClassificationActivity Recognition | CodeCode Available | 1 | 5 |
| AViD Dataset: Anonymized Videos from Diverse Countries | Jul 10, 2020 | Action ClassificationAction Detection | CodeCode Available | 1 | 5 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 | 5 |
| Weakly-supervised Temporal Action Localization by Uncertainty Modeling | Jun 12, 2020 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| Frozen CLIP Models are Efficient Video Learners | Aug 6, 2022 | Action ClassificationDecoder | CodeCode Available | 1 | 5 |
| Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition | Aug 10, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Make Your Training Flexible: Towards Deployment-Efficient Video Models | Mar 18, 2025 | Action ClassificationZero-Shot Video Retrieval | CodeCode Available | 1 | 5 |
| ActionCLIP: A New Paradigm for Video Action Recognition | Sep 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Alleviating Over-segmentation Errors by Detecting Action Boundaries | Jul 14, 2020 | Action ClassificationAction Segmentation | CodeCode Available | 1 | 5 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 | 5 |
| CAST: Cross-Attention in Space and Time for Video Action Recognition | Nov 30, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Can Deep Learning Recognize Subtle Human Activities? | Mar 30, 2020 | Action ClassificationDeep Learning | CodeCode Available | 1 | 5 |
| CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network | Aug 20, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 | 5 |
| Large Scale Holistic Video Understanding | Apr 25, 2019 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Jul 23, 2020 | Action ClassificationKeyword Spotting | CodeCode Available | 1 | 5 |
| Implicit Temporal Modeling with Learnable Alignment for Video Recognition | Apr 20, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| ALIP: Adaptive Language-Image Pre-training with Synthetic Caption | Aug 16, 2023 | Action ClassificationImage-text Retrieval | CodeCode Available | 1 | 5 |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | May 31, 2021 | Action ClassificationVideo Recognition | CodeCode Available | 1 | 5 |