SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 701710 of 1149 papers

TitleStatusHype
Extending Video Masked Autoencoders to 128 frames0
Extensible Hierarchical Method of Detecting Interactive Actions for Video Understanding0
Real-Time Segmentation Networks should be Latency Aware0
Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning0
FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models0
FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding0
Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-Language Models0
Fill-in-the-Blank: A Challenging Video Understanding Evaluation Framework0
Fine-Grain Annotation of Cricket Videos0
Fine-Grained Video Captioning through Scene Graph Consolidation0
Show:102550
← PrevPage 71 of 115Next →

No leaderboard results yet.