SOTAVerified

Action Classification

Papers

Showing 151200 of 457 papers

TitleStatusHype
CoCa: Contrastive Captioners are Image-Text Foundation ModelsCode1
Machine Learning and Signal Processing Based Analysis of sEMG Signals for Daily Action Classification0
An Empirical Study of End-to-End Temporal Action DetectionCode1
Deformable Video Transformer0
SPAct: Self-supervised Privacy Preservation for Action RecognitionCode1
Frame-wise Action Representations for Long Videos via Sequence Contrastive LearningCode1
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-TrainingCode3
Point3D: tracking actions as moving points with 3D CNNs0
DirecFormer: A Directed Attention in Transformer Approach to Robust Action RecognitionCode1
Know your sensORs -- A Modality Study For Surgical Action Classification0
OpenTAL: Towards Open Set Temporal Action LocalizationCode1
Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation FrameworkCode0
Delving Deep into One-Shot Skeleton-based Action Recognition with Diverse OcclusionsCode1
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without SupervisionCode0
Learning To Recognize Procedural Activities with Distant SupervisionCode1
Omnivore: A Single Model for Many Visual ModalitiesCode2
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video RecognitionCode1
End-to-end Generative Pretraining for Multimodal Video Captioning0
Video Transformers: A Survey0
Multiview Transformers for Video RecognitionCode0
MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound0
Improving Video Model Transfer With Dynamic Representation Learning0
Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmarkCode0
Masked Feature Prediction for Self-Supervised Visual Pre-TrainingCode1
Co-training Transformer with Videos and Images Improves Action Recognition0
MViTv2: Improved Multiscale Vision Transformers for Classification and DetectionCode1
Self-supervised Video TransformerCode1
PreViTS: Contrastive Pretraining with Video Tracking Supervision0
Low-Fidelity Video Encoder Optimization for Temporal Action Localization0
Reformulating Zero-shot Action Recognition for Multi-label Actions0
Hierarchical Graph-Convolutional Variational AutoEncoding for Generative Modelling of Human MotionCode0
Florence: A New Foundation Model for Computer VisionCode1
Swin Transformer V2: Scaling Up Capacity and ResolutionCode1
Co-segmentation Inspired Attention Module for Video-based Computer Vision TasksCode1
Revisiting spatio-temporal layouts for compositional action recognitionCode1
MetaVD: A Meta Video Dataset for enhancing human action recognition datasetsCode0
NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy LabelsCode0
TAda! Temporally-Adaptive Convolutions for Video UnderstandingCode0
Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence ClassificationCode1
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation LearningCode1
Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table TennisCode1
Class incremental learning for video action classification0
Unsupervised View-Invariant Human Posture Representation0
ActionCLIP: A New Paradigm for Video Action RecognitionCode1
Revisiting 3D ResNets for Video RecognitionCode0
roadscene2vec: A Tool for Extracting and Embedding Road Scene-GraphsCode1
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action RecognitionCode1
Pose is all you need: The pose only group activity recognition system (POGARS)0
Video Contrastive Learning with Global ContextCode1
Enriching Local and Global Contexts for Temporal Action LocalizationCode1
Show:102550
← PrevPage 4 of 10Next →

No leaderboard results yet.