SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 721730 of 1149 papers

TitleStatusHype
From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction0
From Image to Video, what do we need in multimodal LLMs?0
From Shots to Stories: LLM-Assisted Video Editing with Unified Language Representations0
From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment0
Fully Automated Hand Hygiene Monitoring\ Operating Room using 3D Convolutional Neural Network0
Future semantic segmentation of time-lapsed videos with large temporal displacement0
Gameplay Highlights Generation0
Gaze-Guided Graph Neural Network for Action Anticipation Conditioned on Intention0
Generating the Future With Adversarial Transformers0
Generating Videos with Scene Dynamics0
Show:102550
← PrevPage 73 of 115Next →

No leaderboard results yet.