SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 926950 of 1149 papers

TitleStatusHype
FineAction: A Fine-Grained Video Dataset for Temporal Action LocalizationCode1
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding0
NExT-QA:Next Phase of Question-Answering to Explaining Temporal ActionsCode1
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports ActionsCode1
Relation-aware Hierarchical Attention Framework for Video Question AnsweringCode0
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions0
Stochastic Image-to-Video Synthesis using cINNsCode1
FrameExit: Conditional Early Exiting for Efficient Video RecognitionCode1
Skimming and Scanning for Untrimmed Video Action Recognition0
Temporal Query Networks for Fine-grained Video Understanding0
Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of their Representations for Action Spotting0
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip RetrievalCode1
Temporally smooth online action detection using cycle-consistent future anticipationCode0
Adaptive Intermediate Representations for Video Understanding0
Crossover Learning for Fast Online Video Instance SegmentationCode1
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation0
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation FrameworkCode0
TubeR: Tubelet Transformer for Video Action DetectionCode1
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers0
Visual Semantic Role Labeling for Video UnderstandingCode1
Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation0
Unified Graph Structured Models for Video Understanding0
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization0
Learning Salient Boundary Feature for Anchor-free Temporal Action LocalizationCode1
Temporal Context Aggregation Network for Temporal Action Proposal RefinementCode1
Show:102550
← PrevPage 38 of 46Next →

No leaderboard results yet.