SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1041–1050 of 1149 papers

Title	Date	Tasks	Status	Hype
Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations	Mar 25, 2025	Representation LearningVideo Understanding	CodeCode Available	0
HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding	Jan 3, 2025	Question AnsweringVideo Understanding	CodeCode Available	0
The Visual Centrifuge: Model-Free Layered Video Representations	Dec 4, 2018	Color Constancymodel	CodeCode Available	0
The YouTube-8M Kaggle Competition: Challenges and Methods	Jun 28, 2017	General ClassificationVideo Classification	CodeCode Available	0
Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model	Jun 15, 2024	Question AnsweringVideo Understanding	CodeCode Available	0
The Monkeytyping Solution to the YouTube-8M Video Understanding Challenge	Jun 16, 2017	General ClassificationVideo Classification	CodeCode Available	0
Hierarchical Deep Recurrent Architecture for Video Understanding	Jul 11, 2017	ClassificationGeneral Classification	CodeCode Available	0
Temporal Tessellation: A Unified Approach for Video Analysis	Dec 21, 2016	Action DetectionVideo Captioning	CodeCode Available	0
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding	May 19, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding	Jul 14, 2017	Video RecognitionVideo Understanding	CodeCode Available	0

Show:10 25 50

← PrevPage 105 of 115Next →

No leaderboard results yet.