SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 341350 of 1149 papers

TitleStatusHype
Token Shift Transformer for Video ClassificationCode1
Elaborative Rehearsal for Zero-shot Action RecognitionCode1
Enhancing Self-supervised Video Representation Learning via Multi-level Feature OptimizationCode1
Spatial-Temporal Transformer for Dynamic Scene Graph GenerationCode1
Disentangle Your Dense Object DetectorCode1
Feature Combination Meets Attention: Baidu Soccer Embeddings and Transformer based Temporal DetectionCode1
Can An Image Classifier Suffice For Action Recognition?Code1
Towards Long-Form Video UnderstandingCode1
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?Code1
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive LearningCode1
Show:102550
← PrevPage 35 of 115Next →

No leaderboard results yet.