SOTAVerified

Action Classification

Papers

Showing 401450 of 457 papers

TitleStatusHype
Multi class activity classification in videos using Motion History Image generationCode0
Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation FrameworkCode0
M-PACT: An Open Source Platform for Repeatable Activity Classification ResearchCode0
RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical FlowCode0
TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video ClassificationCode0
Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture RecognitionCode0
ECO: Efficient Convolutional Network for Online Video UnderstandingCode0
What Makes Training Multi-Modal Classification Networks Hard?Code0
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave ConvolutionCode0
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked AutoencodersCode0
Training Deep Neural Networks via Direct Loss MinimizationCode0
Do You Act Like You Talk? Exploring Pose-based Driver Action Classification with Speech Recognition NetworksCode0
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal AggregationCode0
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity RecognitionCode0
Representation Flow for Action RecognitionCode0
MOFO: MOtion FOcused Self-Supervision for Video UnderstandingCode0
Domain and View-point Agnostic Hand Action RecognitionCode0
DejaVid: Encoder-Agnostic Learned Temporal Matching for Video ClassificationCode0
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video ClassificationCode0
Asynchronous Temporal Fields for Action RecognitionCode0
Modality Distillation with Multiple Stream Networks for Action RecognitionCode0
Revisiting 3D ResNets for Video RecognitionCode0
Two-person Graph Convolutional Network for Skeleton-based Human Interaction RecognitionCode0
MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingCode0
MetaVD: A Meta Video Dataset for enhancing human action recognition datasetsCode0
Two-Stream Convolutional Networks for Action Recognition in VideosCode0
MARS: Motion-Augmented RGB Stream for Action RecognitionCode0
Asymmetric Masked Distillation for Pre-Training Small Foundation ModelsCode0
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video ArchitecturesCode0
RWF-2000: An Open Large Scale Video Database for Violence DetectionCode0
Saliency Tubes: Visual Explanations for Spatio-Temporal ConvolutionsCode0
Deep Concept-wise Temporal Convolutional Networks for Action LocalizationCode0
Scaling Vision Transformers to 22 Billion ParametersCode0
Long-Term Feature Banks for Detailed Video UnderstandingCode0
D3D: Distilled 3D Networks for Video Action RecognitionCode0
Compressed Video Action RecognitionCode0
Unmasked Teacher: Towards Training-Efficient Video Foundation ModelsCode0
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action RecognitionCode0
Appearance-and-Relation Networks for Video ClassificationCode0
SoccerDB: A Large-Scale Database for Comprehensive Video UnderstandingCode0
Learn to cycle: Time-consistent feature discovery for action recognitionCode0
Collaborative Spatiotemporal Feature Learning for Video Action RecognitionCode0
Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton based Action RecognitionCode0
Learning Spatio-Temporal Representation with Local and Global DiffusionCode0
ClusterFit: Improving Generalization of Visual RepresentationsCode0
Learning Latent Sub-events in Activity Videos Using Temporal Attention FiltersCode0
Weakly Supervised Action Localization by Sparse Temporal Pooling NetworkCode0
Global Textual Relation Embedding for Relational UnderstandingCode0
Learning Gating ConvNet for Two-Stream based Methods in Action RecognitionCode0
Large-scale weakly-supervised pre-training for video action recognitionCode0
Show:102550
← PrevPage 9 of 10Next →

No leaderboard results yet.