| How Object Information Improves Skeleton-based Human Action Recognition in Assembly Tasks | Jun 9, 2023 | Action ClassificationAction Recognition | —Unverified | 0 |
| Human Action Recognition in Egocentric Perspective Using 2D Object and Hands Pose | Jun 8, 2023 | Action ClassificationAction Recognition | —Unverified | 0 |
| HomE: Homography-Equivariant Video Representation Learning | Jun 2, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Jun 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities | May 18, 2023 | 1 Image, 2*2 StitchiAction Classification | CodeCode Available | 3 |
| Self-Supervised Video Representation Learning via Latent Time Navigation | May 10, 2023 | Action ClassificationAction Recognition | —Unverified | 0 |
| AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Apr 24, 2023 | 3D Hand Pose EstimationAction Classification | CodeCode Available | 1 |
| Implicit Temporal Modeling with Learnable Alignment for Video Recognition | Apr 20, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VicTR: Video-conditioned Text Representations for Activity Recognition | Apr 5, 2023 | Action ClassificationActivity Recognition | —Unverified | 0 |
| VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking | Mar 29, 2023 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Unmasked Teacher: Towards Training-Efficient Video Foundation Models | Mar 28, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| The effectiveness of MAE pre-pretraining for billion-scale pretraining | Mar 23, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Multi-modal Prompting for Low-Shot Temporal Action Localization | Mar 21, 2023 | Action ClassificationAction Localization | —Unverified | 0 |
| ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders | Mar 21, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Dual-path Adaptation from Image to Video Transformers | Mar 17, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Classification of Primitive Manufacturing Tasks from Filtered Event Data | Mar 15, 2023 | Action ClassificationClassification | —Unverified | 0 |
| Scaling Vision Transformers to 22 Billion Parameters | Feb 10, 2023 | Action ClassificationFairness | CodeCode Available | 0 |
| Fine-Grained Action Detection with RGB and Pose Information using Two Stream Convolutional Networks | Feb 6, 2023 | Action ClassificationAction Detection | CodeCode Available | 0 |
| AIM: Adapting Image Models for Efficient Video Action Recognition | Feb 6, 2023 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms | Feb 6, 2023 | Action ClassificationAction Detection | CodeCode Available | 0 |
| Deep Dependency Networks for Multi-Label Classification | Feb 1, 2023 | Action ClassificationClassification | —Unverified | 0 |
| mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video | Feb 1, 2023 | Action ClassificationImage Classification | CodeCode Available | 4 |
| Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework | Jan 10, 2023 | Action ClassificationDecision Making | —Unverified | 0 |
| HierVL: Learning Hierarchical Video-Language Embeddings | Jan 5, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| ReGen: A good Generative Zero-Shot Video Classifier Should be Rewarded | Jan 1, 2023 | Action ClassificationAction Recognition | —Unverified | 0 |