SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 11411149 of 1149 papers

TitleStatusHype
METok: Multi-Stage Event-based Token Compression for Efficient Long Video UnderstandingCode0
Constrained-size Tensorflow Models for YouTube-8M Video Understanding ChallengeCode0
Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022Code0
Context R-CNN: Long Term Temporal Context for Per-Camera Object DetectionCode0
SoccerDB: A Large-Scale Database for Comprehensive Video UnderstandingCode0
Video Action UnderstandingCode0
VURF: A General-purpose Reasoning and Self-refinement Framework for Video UnderstandingCode0
Long-Term Feature Banks for Detailed Video UnderstandingCode0
Localizing Moments in Video with Temporal LanguageCode0
Show:102550
← PrevPage 115 of 115Next →

No leaderboard results yet.