SOTAVerified

Action Classification

Papers

Showing 201250 of 457 papers

TitleStatusHype
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesCode0
Self-Supervised Video Representation Learning via Latent Time Navigation0
VicTR: Video-conditioned Text Representations for Activity Recognition0
Unmasked Teacher: Towards Training-Efficient Video Foundation ModelsCode0
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked AutoencodersCode0
Multi-modal Prompting for Low-Shot Temporal Action Localization0
Classification of Primitive Manufacturing Tasks from Filtered Event Data0
Scaling Vision Transformers to 22 Billion ParametersCode0
Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention MechanismsCode0
Fine-Grained Action Detection with RGB and Pose Information using Two Stream Convolutional NetworksCode0
Deep Dependency Networks for Multi-Label Classification0
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework0
ReGen: A good Generative Zero-Shot Video Classifier Should be Rewarded0
SkeleTR: Towards Skeleton-based Action Recognition in the Wild0
Hierarchical Explanations for Video Action RecognitionCode0
Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations0
Spatio-Temporal Crop Aggregation for Video Representation Learning0
Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies0
3d human motion generation from the text via gesture action classification and the autoregressive model0
EVA: Exploring the Limits of Masked Visual Representation Learning at ScaleCode0
Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization TasksCode0
Egocentric Audio-Visual Noise Suppression0
Adversarial Domain Adaptation for Action Recognition Around the Clock0
Turbo Training with Token Dropout0
Application-Driven AI Paradigm for Human Action Recognition0
RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical FlowCode0
Global Semantic Descriptors for Zero-Shot Action RecognitionCode0
Self-supervised Learning for Unintentional Action Prediction0
OmniVL:One Foundation Model for Image-Language and Video-Language Tasks0
Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton based Action RecognitionCode0
Actor-identified Spatiotemporal Action Detection --- Detecting Who Is Doing What in VideosCode0
Temporal Action Localization with Multi-temporal Scales0
Two-person Graph Convolutional Network for Skeleton-based Human Interaction RecognitionCode0
Is an Object-Centric Video Representation Beneficial for Transfer?0
Context-aware Proposal Network for Temporal Action Detection0
MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingCode0
temporal driver action Localization using action classifications methodCode0
Spatial-temporal Concept based Explanation of 3D ConvNetsCode0
Do we really need temporal convolutions in action segmentation?Code0
Handcrafted localized phase features for human action recognition0
Machine Learning and Signal Processing Based Analysis of sEMG Signals for Daily Action Classification0
Deformable Video Transformer0
Point3D: tracking actions as moving points with 3D CNNs0
Know your sensORs -- A Modality Study For Surgical Action Classification0
Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation FrameworkCode0
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without SupervisionCode0
End-to-end Generative Pretraining for Multimodal Video Captioning0
Video Transformers: A Survey0
Multiview Transformers for Video RecognitionCode0
MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound0
Show:102550
← PrevPage 5 of 10Next →

No leaderboard results yet.