SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 11311140 of 1149 papers

TitleStatusHype
Audio Caption in a Car Setting with a Sentence-Level LossCode0
VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language ModelsCode0
VideoDG: Generalizing Temporal Relations in Videos to Novel DomainsCode0
Detect-and-Track: Efficient Pose Estimation in VideosCode0
MINOTAUR: Multi-task Video Grounding From Multimodal QueriesCode0
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video UnderstandingCode0
Deep Learning Methods for Efficient Large Scale Video LabelingCode0
Creative Flow+ DatasetCode0
Contextual Explainable Video Representation: Human Perception-based UnderstandingCode0
A Challenge to Build Neuro-Symbolic Video AgentsCode0
Show:102550
← PrevPage 114 of 115Next →

No leaderboard results yet.