SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 11211130 of 1149 papers

TitleStatusHype
Temporal Query Networks for Fine-grained Video Understanding0
t-EVA: Time-Efficient t-SNE Video Annotation0
Text-Conditioned Resampler For Long Form Video Understanding0
TextVidBench: A Benchmark for Long Video Scene Text Understanding0
The Open World of Micro-Videos0
Therbligs in Action: Video Understanding through Motion Primitives0
The THUMOS Challenge on Action Recognition for Videos "in the Wild"0
Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders0
Time Blindness: Why Video-Language Models Can't See What Humans Can?0
TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding0
Show:102550
← PrevPage 113 of 115Next →

No leaderboard results yet.