SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–610 of 1149 papers

Title	Date	Tasks	Status	Hype	Score
What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation?	Mar 20, 2025	DecoderGraph Generation	—Unverified	0	0
What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets	Jun 1, 2018	Video Understanding	—Unverified	0	0
When Work Matters: Transforming Classical Network Structures to Graph CNN	Jul 7, 2018	Graph ClassificationVideo Understanding	—Unverified	0	0
WildQA: In-the-Wild Video Question Answering	Sep 14, 2022	Evidence SelectionQuestion Answering	—Unverified	0	0
Wolf: Captioning Everything with a World Summarization Framework	Jul 26, 2024	Autonomous DrivingMixture-of-Experts	—Unverified	0	0
WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning	May 6, 2024	Multiple-choiceVideo Understanding	—Unverified	0	0
WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs	Feb 6, 2025	Video Understanding	—Unverified	0	0
X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding	Jan 12, 2025	Video Understanding	—Unverified	0	0
YouMVOS: An Actor-Centric Multi-Shot Video Object Segmentation Dataset	Jan 1, 2022	ManagementSegmentation	—Unverified	0	0
YouTube-8M Video Understanding Challenge Approach and Applications	Jun 26, 2017	Ensemble LearningVideo Understanding	—Unverified	0	0

Show:10 25 50

← PrevPage 61 of 115Next →

No leaderboard results yet.