SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1071–1080 of 1149 papers

Title	Date	Tasks	Status	Hype	Score
Selective Structured State-Spaces for Long-Form Video Understanding	Mar 25, 2023	Contrastive LearningForm	—Unverified	0	0
Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization	Apr 16, 2025	HallucinationQuestion Answering	—Unverified	0	0
Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding	Mar 26, 2025	GPUQuestion Answering	—Unverified	0	0
Self-supervised Motion Representation via Scattering Local Motion Cues	Aug 1, 2020	Action RecognitionOptical Flow Estimation	—Unverified	0	0
Self-Supervised Object Detection from Egocentric Videos	Jan 1, 2023	Class-agnostic Object DetectionObject	—Unverified	0	0
Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction	Nov 28, 2018	Action RecognitionPrediction	—Unverified	0	0
Self-supervised video pretraining yields robust and more human-aligned visual representations	Oct 12, 2022	Contrastive Learningobject-detection	—Unverified	0	0
Semantics-aware Test-time Adaptation for 3D Human Pose Estimation	Feb 15, 2025	3D human pose and shape estimation3D Human Pose Estimation	—Unverified	0	0
Semantic Segmentation on VSPW Dataset through Masked Video Consistency	Jun 7, 2024	Semantic SegmentationVideo Understanding	—Unverified	0	0
Semi-Parametric Video-Grounded Text Generation	Jan 27, 2023	Language ModelingLanguage Modelling	—Unverified	0	0

Show:10 25 50

← PrevPage 108 of 115Next →

No leaderboard results yet.