| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Jun 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Self-Supervised Video Representation Learning via Latent Time Navigation | May 10, 2023 | Action ClassificationAction Recognition | —Unverified | 0 |
| VicTR: Video-conditioned Text Representations for Activity Recognition | Apr 5, 2023 | Action ClassificationActivity Recognition | —Unverified | 0 |
| Unmasked Teacher: Towards Training-Efficient Video Foundation Models | Mar 28, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders | Mar 21, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Multi-modal Prompting for Low-Shot Temporal Action Localization | Mar 21, 2023 | Action ClassificationAction Localization | —Unverified | 0 |
| Classification of Primitive Manufacturing Tasks from Filtered Event Data | Mar 15, 2023 | Action ClassificationClassification | —Unverified | 0 |
| Scaling Vision Transformers to 22 Billion Parameters | Feb 10, 2023 | Action ClassificationFairness | CodeCode Available | 0 |
| Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms | Feb 6, 2023 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Fine-Grained Action Detection with RGB and Pose Information using Two Stream Convolutional Networks | Feb 6, 2023 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Deep Dependency Networks for Multi-Label Classification | Feb 1, 2023 | Action ClassificationClassification | —Unverified | 0 |
| Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework | Jan 10, 2023 | Action ClassificationDecision Making | —Unverified | 0 |
| ReGen: A good Generative Zero-Shot Video Classifier Should be Rewarded | Jan 1, 2023 | Action ClassificationAction Recognition | —Unverified | 0 |
| SkeleTR: Towards Skeleton-based Action Recognition in the Wild | Jan 1, 2023 | Action ClassificationAction Detection | —Unverified | 0 |
| Hierarchical Explanations for Video Action Recognition | Jan 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations | Dec 6, 2022 | Action ClassificationContrastive Learning | —Unverified | 0 |
| Spatio-Temporal Crop Aggregation for Video Representation Learning | Nov 30, 2022 | Action ClassificationDimensionality Reduction | —Unverified | 0 |
| Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies | Nov 24, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| 3d human motion generation from the text via gesture action classification and the autoregressive model | Nov 18, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| EVA: Exploring the Limits of Masked Visual Representation Learning at Scale | Nov 14, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks | Nov 11, 2022 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Egocentric Audio-Visual Noise Suppression | Nov 7, 2022 | Action ClassificationEvent Detection | —Unverified | 0 |
| Adversarial Domain Adaptation for Action Recognition Around the Clock | Oct 25, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| Turbo Training with Token Dropout | Oct 10, 2022 | Action ClassificationClassification | —Unverified | 0 |
| Application-Driven AI Paradigm for Human Action Recognition | Sep 30, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical Flow | Sep 28, 2022 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Global Semantic Descriptors for Zero-Shot Action Recognition | Sep 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Self-supervised Learning for Unintentional Action Prediction | Sep 24, 2022 | Action ClassificationPrediction | —Unverified | 0 |
| OmniVL:One Foundation Model for Image-Language and Video-Language Tasks | Sep 15, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton based Action Recognition | Sep 7, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Actor-identified Spatiotemporal Action Detection --- Detecting Who Is Doing What in Videos | Aug 27, 2022 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Temporal Action Localization with Multi-temporal Scales | Aug 16, 2022 | Action ClassificationAction Localization | —Unverified | 0 |
| Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition | Aug 12, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Is an Object-Centric Video Representation Beneficial for Transfer? | Jul 20, 2022 | Action ClassificationObject | —Unverified | 0 |
| Context-aware Proposal Network for Temporal Action Detection | Jun 18, 2022 | Action ClassificationAction Detection | —Unverified | 0 |
| MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing | Jun 13, 2022 | 3D ArchitectureAction Classification | CodeCode Available | 0 |
| temporal driver action Localization using action classifications method | Jun 11, 2022 | Action ClassificationAction Localization | CodeCode Available | 0 |
| Spatial-temporal Concept based Explanation of 3D ConvNets | Jun 9, 2022 | Action ClassificationVideo Recognition | CodeCode Available | 0 |
| Do we really need temporal convolutions in action segmentation? | May 26, 2022 | Action ClassificationAction Segmentation | CodeCode Available | 0 |
| Handcrafted localized phase features for human action recognition | May 5, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| Machine Learning and Signal Processing Based Analysis of sEMG Signals for Daily Action Classification | Apr 12, 2022 | Action Classification | —Unverified | 0 |
| Deformable Video Transformer | Mar 31, 2022 | Action Classification | —Unverified | 0 |
| Point3D: tracking actions as moving points with 3D CNNs | Mar 20, 2022 | Action ClassificationAction Localization | —Unverified | 0 |
| Know your sensORs -- A Modality Study For Surgical Action Classification | Mar 16, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework | Mar 8, 2022 | 3D Human Pose EstimationAction Classification | CodeCode Available | 0 |
| Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision | Feb 16, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| End-to-end Generative Pretraining for Multimodal Video Captioning | Jan 20, 2022 | Action ClassificationDecoder | —Unverified | 0 |
| Video Transformers: A Survey | Jan 16, 2022 | Action ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Multiview Transformers for Video Recognition | Jan 12, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound | Jan 7, 2022 | Action ClassificationNavigate | —Unverified | 0 |