| Alleviating Over-segmentation Errors by Detecting Action Boundaries | Jul 14, 2020 | Action ClassificationAction Segmentation | CodeCode Available | 1 |
| DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition | Mar 19, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Infrared and 3D skeleton feature fusion for RGB-D action recognition | Feb 28, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| An Empirical Study of End-to-End Temporal Action Detection | Apr 6, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Mar 17, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| CoCa: Contrastive Captioners are Image-Text Foundation Models | May 4, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| An Evaluation of Action Recognition Models on EPIC-Kitchens | Aug 2, 2019 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers | Jun 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| HierVL: Learning Hierarchical Video-Language Embeddings | Jan 5, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games | Jul 12, 2021 | Action ClassificationActivity Recognition | CodeCode Available | 1 |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | May 31, 2021 | Action ClassificationVideo Recognition | CodeCode Available | 1 |
| Masked Feature Prediction for Self-Supervised Visual Pre-Training | Dec 16, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| ConvNet Architecture Search for Spatiotemporal Feature Learning | Aug 16, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Memory-augmented Dense Predictive Coding for Video Representation Learning | Aug 3, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos | May 26, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network | Aug 20, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Jul 23, 2020 | Action ClassificationKeyword Spotting | CodeCode Available | 1 |
| MoViNets: Mobile Video Networks for Efficient Video Recognition | Mar 21, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| High Quality Monocular Depth Estimation via Transfer Learning | Dec 31, 2018 | Action ClassificationDecoder | CodeCode Available | 1 |
| Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning | Mar 28, 2022 | Action ClassificationContrastive Learning | CodeCode Available | 1 |
| Boundary-sensitive Pre-training for Temporal Localization in Videos | Nov 21, 2020 | Action ClassificationClassification | CodeCode Available | 1 |
| OpenTAL: Towards Open Set Temporal Action Localization | Mar 10, 2022 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Dual-path Adaptation from Image to Video Transformers | Mar 17, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Frozen CLIP Models are Efficient Video Learners | Aug 6, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |