SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 611620 of 1149 papers

TitleStatusHype
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection0
Zero-shot Action Localization via the Confidence of Large Vision-Language Models0
Zero-Shot Action Recognition in Surveillance Videos0
Zero-Shot Action Recognition in Videos: A Survey0
Zero-Shot Long-Form Video Understanding through Screenplay0
Zero-shot Shark Tracking and Biometrics from Aerial Imagery0
Hierarchical Video Frame Sequence Representation with Deep Convolutional Graph Network0
Zero-Shot Video Question Answering with Procedural Programs0
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Multimodal Fusion and Coherence Modeling for Video Topic Segmentation0
Show:102550
← PrevPage 62 of 115Next →

No leaderboard results yet.