SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 491500 of 1149 papers

TitleStatusHype
Vript: A Video Is Worth Thousands of WordsCode2
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Semantic Segmentation on VSPW Dataset through Masked Video Consistency0
ShareGPT4Video: Improving Video Understanding and Generation with Better CaptionsCode5
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation0
MLVU: Benchmarking Multi-task Long Video UnderstandingCode3
Contrastive Language Video Time Pre-training0
Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric VideosCode1
HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model0
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation0
Show:102550
← PrevPage 50 of 115Next →

No leaderboard results yet.