SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 431440 of 1149 papers

TitleStatusHype
VideoQA in the Era of LLMs: An Empirical StudyCode0
LLaVA-OneVision: Easy Visual Task TransferCode0
FE-Adapter: Adapting Image-based Emotion Classifiers to Videos0
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language BenchmarkCode1
Multimodal Fusion and Coherence Modeling for Video Topic Segmentation0
Segment Anything for Videos: A Systematic SurveyCode5
Learning Video Context as Interleaved Multimodal SequencesCode1
Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter0
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation0
Wolf: Captioning Everything with a World Summarization Framework0
Show:102550
← PrevPage 44 of 115Next →

No leaderboard results yet.