SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 831840 of 1149 papers

TitleStatusHype
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning0
Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental LearningCode0
Action Sensitivity Learning for Temporal Action Localization0
Learning Higher-order Object Interactions for Keypoint-based Video Understanding0
A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero ShotCode0
Vehicle Detection and Classification without Residual Calculation: Accelerating HEVC Image Decoding with Random Perturbation Injection0
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System0
MRSN: Multi-Relation Support Network for Video Action Detection0
Search-Map-Search: A Frame Selection Paradigm for Action Recognition0
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision0
Show:102550
← PrevPage 84 of 115Next →

No leaderboard results yet.