SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 691700 of 1149 papers

TitleStatusHype
VideoQA in the Era of LLMs: An Empirical StudyCode0
LLaVA-OneVision: Easy Visual Task TransferCode0
FE-Adapter: Adapting Image-based Emotion Classifiers to Videos0
Multimodal Fusion and Coherence Modeling for Video Topic Segmentation0
Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter0
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation0
Wolf: Captioning Everything with a World Summarization Framework0
Audio-visual training for improved grounding in video-text LLMs0
Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data0
Open Vocabulary Multi-Label Video Classification0
Show:102550
← PrevPage 70 of 115Next →

No leaderboard results yet.