SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 331340 of 1149 papers

TitleStatusHype
End-to-End Streaming Video Temporal Action Segmentation with Reinforce LearningCode1
Open-Vocabulary Video Relation ExtractionCode1
Panoptic Video Scene Graph GenerationCode1
Panoramic Vision Transformer for Saliency Detection in 360° VideosCode1
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action RecognitionCode1
Learning the Predictability of the FutureCode1
Localizing Moments in Long Video Via Multimodal GuidanceCode1
Language Repository for Long Video UnderstandingCode1
CAMEL-Bench: A Comprehensive Arabic LMM BenchmarkCode1
Show:102550
← PrevPage 34 of 115Next →

No leaderboard results yet.