SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10111020 of 1149 papers

TitleStatusHype
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection0
Zero-shot Action Localization via the Confidence of Large Vision-Language Models0
Zero-Shot Action Recognition in Surveillance Videos0
Zero-Shot Action Recognition in Videos: A Survey0
Zero-Shot Long-Form Video Understanding through Screenplay0
Zero-shot Shark Tracking and Biometrics from Aerial Imagery0
Hierarchical Video Frame Sequence Representation with Deep Convolutional Graph Network0
4D Generic Video Object ProposalsCode0
LMM-VQA: Advancing Video Quality Assessment with Large Multimodal ModelsCode0
LLaVA-OneVision: Easy Visual Task TransferCode0
Show:102550
← PrevPage 102 of 115Next →

No leaderboard results yet.