SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 491500 of 1149 papers

TitleStatusHype
Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval0
Learning Dynamic MRI Reconstruction with Convolutional Network Assisted Reconstruction Swin Transformer0
Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking0
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment0
Learning from Multiple Sources for Video Summarisation0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
Inductive Attention for Video Action Anticipation0
Discrete neural representations for explainable anomaly detection0
Improving Video Model Transfer With Dynamic Representation Learning0
Improving LLM Video Understanding with 16 Frames Per Second0
Show:102550
← PrevPage 50 of 115Next →

No leaderboard results yet.