SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 281290 of 1149 papers

TitleStatusHype
REVECA -- Rich Encoder-decoder framework for Video Event CAptionerCode1
Large Scale Holistic Video UnderstandingCode1
Revisiting spatio-temporal layouts for compositional action recognitionCode1
Hier-EgoPack: Hierarchical Egocentric Video Understanding with Diverse Task PerspectivesCode1
A Simple LLM Framework for Long-Range Video Question-AnsweringCode1
ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video UnderstandingCode1
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot InteractionCode1
A Dataset for Medical Instructional Video Classification and Question AnsweringCode1
Self-Adaptive Sampling for Efficient Video Question-Answering on Image--Text ModelsCode1
CATER: A diagnostic dataset for Compositional Actions and TEmporal ReasoningCode1
Show:102550
← PrevPage 29 of 115Next →

No leaderboard results yet.