SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 91100 of 1149 papers

TitleStatusHype
Gameplay Highlights Generation0
Seed1.5-VL Technical Report0
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant0
RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph0
Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly DetectionCode1
VideoLLM Benchmarks and Evaluation: A Survey0
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video UnderstandingCode1
TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in ActionCode1
Empowering Agentic Video Analytics Systems with Video Language Models0
SeriesBench: A Benchmark for Narrative-Driven Drama Series UnderstandingCode0
Show:102550
← PrevPage 10 of 115Next →

No leaderboard results yet.