SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 771780 of 1149 papers

TitleStatusHype
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering0
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding0
DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding0
EAGLE: Egocentric AGgregated Language-video Engine0
Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey0
Efficient Modelling Across Time of Human Actions and Interactions0
Efficient Motion-Aware Video MLLM0
Efficient Video Understanding via Layered Multi Frame-Rate Analysis0
EgoEnv: Human-centric environment representations from egocentric video0
Egocentric Video Task Translation0
Show:102550
← PrevPage 78 of 115Next →

No leaderboard results yet.