SOTAVerified

Action Classification

Papers

Showing 101125 of 457 papers

TitleStatusHype
Latent Embedding Feedback and Discriminative Features for Zero-Shot ClassificationCode1
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action VideosCode1
AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose EstimationCode1
DirecFormer: A Directed Attention in Transformer Approach to Robust Action RecognitionCode1
A Closer Look at Spatiotemporal Convolutions for Action RecognitionCode1
Learning Spatiotemporal Features via Video and Text Pair DiscriminationCode1
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video ProcessingCode1
Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video GamesCode1
Stand-Alone Inter-Frame Attention in Video ModelsCode1
MoViNets: Mobile Video Networks for Efficient Video RecognitionCode1
Boundary-sensitive Pre-training for Temporal Localization in VideosCode1
Florence: A New Foundation Model for Computer VisionCode1
Enriching Local and Global Contexts for Temporal Action LocalizationCode1
Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence ClassificationCode1
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked AutoencodersCode1
Memory-augmented Dense Predictive Coding for Video Representation LearningCode1
Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity RecognitionCode1
Dual-path Adaptation from Image to Video TransformersCode1
AViD Dataset: Anonymized Videos from Diverse CountriesCode1
MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D VideosCode1
BABEL: Bodies, Action and Behavior with English LabelsCode1
A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action DetectorCode1
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action RecognitionCode1
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action UnderstandingCode1
ViViT: A Video Vision TransformerCode1
Show:102550
← PrevPage 5 of 19Next →

No leaderboard results yet.