SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 771–780 of 1149 papers

Title	Date	Tasks	Status	Hype	Score
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering	Jul 1, 2022	Question AnsweringVideo Question Answering	—Unverified	0	0
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding	Nov 19, 2024	Question AnsweringVideo Understanding	—Unverified	0	0
DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding	Jun 4, 2025	MMEVideo MME	—Unverified	0	0
EAGLE: Egocentric AGgregated Language-video Engine	Sep 26, 2024	Action RecognitionActivity Recognition	—Unverified	0	0
Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey	Jun 5, 2022	3D Hand Pose EstimationDomain Adaptation	—Unverified	0	0
Efficient Modelling Across Time of Human Actions and Interactions	Oct 5, 2021	Action RecognitionVideo Understanding	—Unverified	0	0
Efficient Motion-Aware Video MLLM	Jan 1, 2025	Question AnsweringVideo Question Answering	—Unverified	0	0
Efficient Video Understanding via Layered Multi Frame-Rate Analysis	Nov 24, 2018	Autonomous DrivingVideo Understanding	—Unverified	0	0
EgoEnv: Human-centric environment representations from egocentric video	Jul 22, 2022	Video Understanding	—Unverified	0	0
Egocentric Video Task Translation	Dec 13, 2022	Multi-Task LearningTranslation	—Unverified	0	0

Show:10 25 50

← PrevPage 78 of 115Next →

No leaderboard results yet.