SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10711080 of 1149 papers

TitleStatusHype
Selective Structured State-Spaces for Long-Form Video Understanding0
Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization0
Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding0
Self-supervised Motion Representation via Scattering Local Motion Cues0
Self-Supervised Object Detection from Egocentric Videos0
Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction0
Self-supervised video pretraining yields robust and more human-aligned visual representations0
Semantics-aware Test-time Adaptation for 3D Human Pose Estimation0
Semantic Segmentation on VSPW Dataset through Masked Video Consistency0
Semi-Parametric Video-Grounded Text Generation0
Show:102550
← PrevPage 108 of 115Next →

No leaderboard results yet.