SOTAVerified

Action Classification

Papers

Showing 1120 of 457 papers

TitleStatusHype
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language ModelsCode2
Learning Video Representations from Large Language ModelsCode2
MARLIN: Masked Autoencoder for facial video Representation LearnINgCode2
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormerCode2
Revisiting Classifier: Transferring Vision-Language Models for Video RecognitionCode2
Omnivore: A Single Model for Many Visual ModalitiesCode2
Video Swin TransformerCode2
Is Space-Time Attention All You Need for Video Understanding?Code2
X3D: Expanding Architectures for Efficient Video RecognitionCode2
Omni-sourced Webly-supervised Learning for Video RecognitionCode2
Show:102550
← PrevPage 2 of 46Next →

No leaderboard results yet.