| Finding the Missing Data: A BERT-inspired Approach Against Package Loss in Wireless Sensing | Mar 19, 2024 | Action ClassificationDeep Learning | CodeCode Available | 1 |
| Region-based Non-local Operation for Video Classification | Jul 17, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Apr 24, 2023 | 3D Hand Pose EstimationAction Classification | CodeCode Available | 1 |
| DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition | Mar 19, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| A Closer Look at Spatiotemporal Convolutions for Action Recognition | Nov 30, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Revisiting spatio-temporal layouts for compositional action recognition | Nov 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing | Sep 30, 2020 | Action ClassificationVideo Recognition | CodeCode Available | 1 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning | Nov 27, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Skeleton-based Action Recognition with Convolutional Neural Networks | Apr 25, 2017 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Boundary-sensitive Pre-training for Temporal Localization in Videos | Nov 21, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition | Jul 27, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Frozen CLIP Models are Efficient Video Learners | Aug 6, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |
| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Temporal Action Localization with Cross Layer Task Decoupling and Refinement | Dec 12, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| HierVL: Learning Hierarchical Video-Language Embeddings | Jan 5, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Dual-path Adaptation from Image to Video Transformers | Mar 17, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| AViD Dataset: Anonymized Videos from Diverse Countries | Jul 10, 2020 | Action ClassificationAction Detection | CodeCode Available | 1 |
| The effectiveness of MAE pre-pretraining for billion-scale pretraining | Mar 23, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| BABEL: Bodies, Action and Behavior with English Labels | Jun 17, 2021 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis | Sep 29, 2021 | Action ClassificationClassification | CodeCode Available | 1 |
| High Quality Monocular Depth Estimation via Transfer Learning | Dec 31, 2018 | Action ClassificationDecoder | CodeCode Available | 1 |
| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Jun 13, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| ViViT: A Video Vision Transformer | Mar 29, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |