SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 671680 of 1149 papers

TitleStatusHype
VideoGLUE: Video General Understanding Evaluation of Foundation Models0
ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval ModelsCode0
Temporal Action Proposal Generation With Action Frequency Adaptive NetworkCode0
An overview on the evaluated video retrieval tasks at TRECVID 2022Code1
Multi-Granularity Hand Action DetectionCode1
Learning Space-Time Semantic Correspondences0
EPIC Fields: Marrying 3D Geometry and Video UnderstandingCode1
Valley: Video Assistant with Large Language model Enhanced abilitYCode2
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language ModelsCode3
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment0
Show:102550
← PrevPage 68 of 115Next →

No leaderboard results yet.