SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 921930 of 1149 papers

TitleStatusHype
Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking0
Technical Report: Temporal Aggregate RepresentationsCode1
Transformed ROIs for Capturing Visual Transformations in Videos0
A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP0
Highlight Timestamp Detection Model for Comedy Videos via Multimodal Sentiment Analysis0
FineAction: A Fine-Grained Video Dataset for Temporal Action LocalizationCode1
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding0
NExT-QA:Next Phase of Question-Answering to Explaining Temporal ActionsCode1
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports ActionsCode1
Relation-aware Hierarchical Attention Framework for Video Question AnsweringCode0
Show:102550
← PrevPage 93 of 115Next →

No leaderboard results yet.