SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10911100 of 1149 papers

TitleStatusHype
Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition0
Massively Parallel Video Networks0
What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets0
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning0
DenseImage Network: Video Spatial-Temporal Evolution Encoding and Understanding0
Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning0
Dilated Temporal Relational Adversarial Network for Generic Video Summarization0
Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos0
ECO: Efficient Convolutional Network for Online Video UnderstandingCode0
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video CaptioningCode0
Show:102550
← PrevPage 110 of 115Next →

No leaderboard results yet.