SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 281290 of 1149 papers

TitleStatusHype
Lightweight Network Architecture for Real-Time Action RecognitionCode1
A Simple LLM Framework for Long-Range Video Question-AnsweringCode1
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot InteractionCode1
A Dataset for Medical Instructional Video Classification and Question AnsweringCode1
Learning the Predictability of the FutureCode1
Learning Temporally Latent Causal Processes from General Temporal DataCode1
Learning Transferable Spatiotemporal Representations from Natural Script KnowledgeCode1
CATER: A diagnostic dataset for Compositional Actions and TEmporal ReasoningCode1
CAST: Cross-Attention in Space and Time for Video Action RecognitionCode1
Towards Visually Explaining Video Understanding Networks with PerturbationCode1
Show:102550
← PrevPage 29 of 115Next →

No leaderboard results yet.