SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 311320 of 1149 papers

TitleStatusHype
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across HeadsCode1
MECD+: Unlocking Event-Level Causal Graph Discovery for Video ReasoningCode1
Multimodal Long Video Modeling Based on Temporal Dynamic ContextCode1
Learning Temporally Latent Causal Processes from General Temporal DataCode1
F^3Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from VideosCode1
Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual AwarenessCode1
Learning Temporally Causal Latent Processes from General Temporal DataCode1
Learning the Predictability of the FutureCode1
Learning Self-Similarity in Space and Time as a Generalized Motion for Action RecognitionCode1
End-to-End Video Instance Segmentation with TransformersCode1
Show:102550
← PrevPage 32 of 115Next →

No leaderboard results yet.