SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10211030 of 1149 papers

TitleStatusHype
PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization0
Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries0
Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark0
Personalized Video Summarization by Multimodal Video Understanding0
Person Count Localization in Videos From Noisy Foreground and Detections0
PEVLM: Parallel Encoding for Vision-Language Models0
PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval0
Principles of Visual Tokens for Efficient Video Understanding0
ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab0
Progress-Aware Video Frame Captioning0
Show:102550
← PrevPage 103 of 115Next →

No leaderboard results yet.