SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 211220 of 1149 papers

TitleStatusHype
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action DetectionCode3
M-LLM Based Video Frame Selection for Efficient Video Understanding0
InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model0
An Analysis of Data Transformation Effects on Segment Anything 20
Task Graph Maximum Likelihood Estimation for Procedural Activity Understanding in Egocentric VideosCode1
Fine-Grained Video Captioning through Scene Graph Consolidation0
LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models0
AVD2: Accident Video Diffusion for Accident Video Description0
MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval0
iMOVE: Instance-Motion-Aware Video Understanding0
Show:102550
← PrevPage 22 of 115Next →

No leaderboard results yet.