SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10411050 of 1149 papers

TitleStatusHype
Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video RepresentationsCode0
HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video UnderstandingCode0
The Visual Centrifuge: Model-Free Layered Video RepresentationsCode0
The YouTube-8M Kaggle Competition: Challenges and MethodsCode0
Beyond Raw Videos: Understanding Edited Videos with Large Multimodal ModelCode0
The Monkeytyping Solution to the YouTube-8M Video Understanding ChallengeCode0
Hierarchical Deep Recurrent Architecture for Video UnderstandingCode0
Temporal Tessellation: A Unified Approach for Video AnalysisCode0
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video UnderstandingCode0
Temporal Modeling Approaches for Large-scale Youtube-8M Video UnderstandingCode0
Show:102550
← PrevPage 105 of 115Next →

No leaderboard results yet.