SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 571580 of 1149 papers

TitleStatusHype
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges0
VideoLLM Benchmarks and Evaluation: A Survey0
VideoMCC: a New Benchmark for Video Comprehension0
Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition0
VideoPrism: A Foundational Visual Encoder for Video Understanding0
Videoprompter: an ensemble of foundational models for zero-shot video understanding0
Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling0
Video RWKV:Video Action Recognition Based RWKV0
VideoSAVi: Self-Aligned Video Language Models without Human Supervision0
VideoScan: Enabling Efficient Streaming Video Understanding via Frame-level Semantic Carriers0
Show:102550
← PrevPage 58 of 115Next →

No leaderboard results yet.