SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 271–280 of 1149 papers

Title	Date	Tasks	Status	Hype
Transformer-Based Model for Monocular Visual Odometry: A Video Understanding Approach	May 10, 2023	Autonomous VehiclesMonocular Visual Odometry	CodeCode Available	1
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer	Apr 29, 2023	DecoderHighlight Detection	CodeCode Available	1
Event-Free Moving Object Segmentation from Moving Ego Vehicle	Apr 28, 2023	Autonomous DrivingBenchmarking	CodeCode Available	1
Leveraging triplet loss for unsupervised action segmentation	Apr 13, 2023	Action SegmentationClustering	CodeCode Available	1
Procedure-Aware Pretraining for Instructional Video Understanding	Mar 31, 2023	Video Understanding	CodeCode Available	1
Whether and When does Endoscopy Domain Pretraining Make Sense?	Mar 30, 2023	Action Triplet DetectionSurgical phase recognition	CodeCode Available	1
Streaming Video Model	Mar 30, 2023	Action RecognitionDecoder	CodeCode Available	1
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition	Mar 28, 2023	Action RecognitionOptical Flow Estimation	CodeCode Available	1
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos	Mar 22, 2023	Representation LearningSentence	CodeCode Available	1
Dual-path Adaptation from Image to Video Transformers	Mar 17, 2023	Action ClassificationAction Recognition	CodeCode Available	1

Show:10 25 50

← PrevPage 28 of 115Next →

No leaderboard results yet.