SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 911920 of 1149 papers

TitleStatusHype
Adversarial Machine Learning Attacks Against Video Anomaly Detection Systems0
MM-SEAL: A Large-scale Video Dataset of Multi-person Multi-grained Spatio-temporally Action Localization0
PYSKL: a toolbox for skeleton-based video understanding0
FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding TasksCode0
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow AnalysisCode0
Human Gaze Guided Attention for Surgical Activity Recognition0
Multi-Scale Self-Contrastive Learning with Hard Negative Mining for Weakly-Supervised Query-based Video Grounding0
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection0
Concept Graph Neural Networks for Surgical Video Understanding0
Audio Visual Scene-Aware Dialog Generation with Transformer-based Video Representations0
Show:102550
← PrevPage 92 of 115Next →

No leaderboard results yet.