SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 931940 of 1149 papers

TitleStatusHype
UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection0
UBoCo: Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection0
Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges0
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks0
Understanding Action Sequences based on Video Captioning for Learning-from-Observation0
Understanding Long Videos via LLM-Powered Entity Relation Graphs0
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation0
UniDual: A Unified Model for Image and Video Understanding0
Unified Graph Structured Models for Video Understanding0
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action0
Show:102550
← PrevPage 94 of 115Next →

No leaderboard results yet.