SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10761100 of 1149 papers

TitleStatusHype
Representation Flow for Action RecognitionCode0
Learnable Pooling Methods for Video ClassificationCode0
Non-local NetVLAD Encoding for Video Classification0
Large-Scale Video Classification with Feature Space Augmentation coupled with Learned Label Relations and Ensembling0
Label Denoising with Large Ensembles of Heterogeneous Neural Networks0
Localizing Moments in Video with Temporal LanguageCode0
End-to-End Joint Semantic Segmentation of Actors and Actions in Video0
Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks0
Constrained-size Tensorflow Models for YouTube-8M Video Understanding ChallengeCode0
Diagnosing Error in Temporal Action DetectorsCode0
Video Time: Properties, Encoders and Evaluation0
Query-Conditioned Three-Player Adversarial Network for Video Summarization0
When Work Matters: Transforming Classical Network Structures to Graph CNN0
Long Activity Video Understanding using Functional Object-Oriented Network0
Deep Spatio-Temporal Random Fields for Efficient Video Segmentation0
Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition0
Massively Parallel Video Networks0
What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets0
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning0
DenseImage Network: Video Spatial-Temporal Evolution Encoding and Understanding0
Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning0
Dilated Temporal Relational Adversarial Network for Generic Video Summarization0
Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos0
ECO: Efficient Convolutional Network for Online Video UnderstandingCode0
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video CaptioningCode0
Show:102550
← PrevPage 44 of 46Next →

No leaderboard results yet.