SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 981990 of 1149 papers

TitleStatusHype
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation FrameworkCode0
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers0
Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation0
Unified Graph Structured Models for Video Understanding0
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization0
ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation0
Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training0
PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization0
Unsupervised Motion Representation Enhanced Network for Action Recognition0
Win-Fail Action RecognitionCode0
Show:102550
← PrevPage 99 of 115Next →

No leaderboard results yet.