SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 941950 of 1149 papers

TitleStatusHype
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation0
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation FrameworkCode0
TubeR: Tubelet Transformer for Video Action DetectionCode1
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers0
Visual Semantic Role Labeling for Video UnderstandingCode1
Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation0
Unified Graph Structured Models for Video Understanding0
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization0
Learning Salient Boundary Feature for Anchor-free Temporal Action LocalizationCode1
Temporal Context Aggregation Network for Temporal Action Proposal RefinementCode1
Show:102550
← PrevPage 95 of 115Next →

No leaderboard results yet.