SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 971980 of 1149 papers

TitleStatusHype
Highlight Timestamp Detection Model for Comedy Videos via Multimodal Sentiment Analysis0
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding0
Relation-aware Hierarchical Attention Framework for Video Question AnsweringCode0
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions0
Skimming and Scanning for Untrimmed Video Action Recognition0
Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of their Representations for Action Spotting0
Temporal Query Networks for Fine-grained Video Understanding0
Temporally smooth online action detection using cycle-consistent future anticipationCode0
Adaptive Intermediate Representations for Video Understanding0
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation0
Show:102550
← PrevPage 98 of 115Next →

No leaderboard results yet.