SOTAVerified

Action Classification

Papers

Showing 101150 of 457 papers

TitleStatusHype
TubeR: Tubelet Transformer for Video Action DetectionCode1
Busy-Quiet Video Disentangling for Video ClassificationCode1
ViViT: A Video Vision TransformerCode1
An Image is Worth 16x16 Words, What is a Video Worth?Code1
MoViNets: Mobile Video Networks for Efficient Video RecognitionCode1
Revisiting ResNets: Improved Training and Scaling StrategiesCode1
TCLR: Temporal Contrastive Learning for Video RepresentationCode1
TDN: Temporal Difference Networks for Efficient Action RecognitionCode1
MVFNet: Multi-View Fusion Network for Efficient Video RecognitionCode1
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization TasksCode1
Boundary-sensitive Pre-training for Temporal Localization in VideosCode1
Mutual Modality Learning for Video Action ClassificationCode1
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video ProcessingCode1
Memory-augmented Dense Predictive Coding for Video Representation LearningCode1
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cuesCode1
MotionSqueeze: Neural Motion Feature Learning for Video UnderstandingCode1
Region-based Non-local Operation for Video ClassificationCode1
Alleviating Over-segmentation Errors by Detecting Action BoundariesCode1
AViD Dataset: Anonymized Videos from Diverse CountriesCode1
VPN: Learning Video-Pose Embedding for Activities of Daily LivingCode1
Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual RecognitionCode1
Weakly-supervised Temporal Action Localization by Uncertainty ModelingCode1
Can Deep Learning Recognize Subtle Human Activities?Code1
Latent Embedding Feedback and Discriminative Features for Zero-Shot ClassificationCode1
Infrared and 3D skeleton feature fusion for RGB-D action recognitionCode1
Over-the-Air Adversarial Flickering Attacks against Video Recognition NetworksCode1
Learning Spatiotemporal Features via Video and Text Pair DiscriminationCode1
Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods ComparisonCode1
An Evaluation of Action Recognition Models on EPIC-KitchensCode1
Large Scale Holistic Video UnderstandingCode1
What and How Well You Performed? A Multitask Learning Approach to Action Quality AssessmentCode1
High Quality Monocular Depth Estimation via Transfer LearningCode1
SlowFast Networks for Video RecognitionCode1
Timeception for Complex Action RecognitionCode1
TSM: Temporal Shift Module for Efficient Video UnderstandingCode1
SoccerNet: A Scalable Dataset for Action Spotting in Soccer VideosCode1
A Closer Look at Spatiotemporal Convolutions for Action RecognitionCode1
Non-local Neural NetworksCode1
ConvNet Architecture Search for Spatiotemporal Feature LearningCode1
Quo Vadis, Action Recognition? A New Model and the Kinetics DatasetCode1
The Kinetics Human Action Video DatasetCode1
Skeleton-based Action Recognition with Convolutional Neural NetworksCode1
Visual Semantic Role LabelingCode1
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis0
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained VideosCode0
Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action RecognitionCode0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes0
Domain Adaptation of VLM for Soccer Video Understanding0
OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition0
Show:102550
← PrevPage 3 of 10Next →

No leaderboard results yet.