| SkeleTR: Towards Skeleton-based Action Recognition in the Wild | Jan 1, 2023 | Action ClassificationAction Detection | —Unverified | 0 |
| Hierarchical Explanations for Video Action Recognition | Jan 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models | Dec 31, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Learning Video Representations from Large Language Models | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning | Dec 6, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| InternVideo: General Video Foundation Models via Generative and Discriminative Learning | Dec 6, 2022 | Action ClassificationAction Recognition | CodeCode Available | 4 |
| Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations | Dec 6, 2022 | Action ClassificationContrastive Learning | —Unverified | 0 |
| Spatio-Temporal Crop Aggregation for Video Representation Learning | Nov 30, 2022 | Action ClassificationDimensionality Reduction | —Unverified | 0 |
| Post-Processing Temporal Action Detection | Nov 27, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning | Nov 25, 2022 | Action ClassificationClassification | CodeCode Available | 1 |
| Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies | Nov 24, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| 3d human motion generation from the text via gesture action classification and the autoregressive model | Nov 18, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders | Nov 16, 2022 | Action ClassificationRepresentation Learning | CodeCode Available | 1 |
| EVA: Exploring the Limits of Masked Visual Representation Learning at Scale | Nov 14, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| MARLIN: Masked Autoencoder for facial video Representation LearnINg | Nov 12, 2022 | Action ClassificationAttribute | CodeCode Available | 2 |
| Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks | Nov 11, 2022 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Egocentric Audio-Visual Noise Suppression | Nov 7, 2022 | Action ClassificationEvent Detection | —Unverified | 0 |
| Adversarial Domain Adaptation for Action Recognition Around the Clock | Oct 25, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| Turbo Training with Token Dropout | Oct 10, 2022 | Action ClassificationClassification | —Unverified | 0 |
| Application-Driven AI Paradigm for Human Action Recognition | Sep 30, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical Flow | Sep 28, 2022 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Self-supervised Learning for Unintentional Action Prediction | Sep 24, 2022 | Action ClassificationPrediction | —Unverified | 0 |
| Global Semantic Descriptors for Zero-Shot Action Recognition | Sep 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer | Sep 22, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| OmniVL:One Foundation Model for Image-Language and Video-Language Tasks | Sep 15, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton based Action Recognition | Sep 7, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting | Aug 31, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Actor-identified Spatiotemporal Action Detection --- Detecting Who Is Doing What in Videos | Aug 27, 2022 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Temporal Action Localization with Multi-temporal Scales | Aug 16, 2022 | Action ClassificationAction Localization | —Unverified | 0 |
| Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition | Aug 12, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Frozen CLIP Models are Efficient Video Learners | Aug 6, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| Expanding Language-Image Pretrained Models for General Video Recognition | Aug 4, 2022 | Action ClassificationAction Recognition | CodeCode Available | 3 |
| Class-Difficulty Based Methods for Long-Tailed Visual Recognition | Jul 29, 2022 | Action Classificationimage-classification | CodeCode Available | 1 |
| Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition | Jul 27, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MAR: Masked Autoencoders for Efficient Action Recognition | Jul 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Is an Object-Centric Video Representation Beneficial for Transfer? | Jul 20, 2022 | Action ClassificationObject | —Unverified | 0 |
| ReAct: Temporal Action Detection with Relational Queries | Jul 14, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| Revisiting Classifier: Transferring Vision-Language Models for Video Recognition | Jul 4, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning | Jun 27, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos | Jun 25, 2022 | Action ClassificationClustering | CodeCode Available | 1 |
| Context-aware Proposal Network for Temporal Action Detection | Jun 18, 2022 | Action ClassificationAction Detection | —Unverified | 0 |
| Stand-Alone Inter-Frame Attention in Video Models | Jun 14, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing | Jun 13, 2022 | 3D ArchitectureAction Classification | CodeCode Available | 0 |
| temporal driver action Localization using action classifications method | Jun 11, 2022 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Spatial-temporal Concept based Explanation of 3D ConvNets | Jun 9, 2022 | Action ClassificationVideo Recognition | CodeCode Available | 0 |
| A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector | Jun 7, 2022 | Action ClassificationAction Detection | CodeCode Available | 1 |
| MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos | May 26, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Do we really need temporal convolutions in action segmentation? | May 26, 2022 | Action ClassificationAction Segmentation | CodeCode Available | 0 |
| Handcrafted localized phase features for human action recognition | May 5, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |