SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 381390 of 1149 papers

TitleStatusHype
Temporally-Weighted Hierarchical Clustering for Unsupervised Action SegmentationCode1
TSM: Temporal Shift Module for Efficient Video UnderstandingCode1
HAT: History-Augmented Anchor Transformer for Online Temporal Action LocalizationCode1
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensCode1
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video UnderstandingCode1
Large Scale Holistic Video UnderstandingCode1
REVECA -- Rich Encoder-decoder framework for Video Event CAptionerCode1
Relaxed Transformer Decoders for Direct Action Proposal GenerationCode1
MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss AlpsCode1
TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in ActionCode1
Show:102550
← PrevPage 39 of 115Next →

No leaderboard results yet.