SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 351360 of 1149 papers

TitleStatusHype
An overview on the evaluated video retrieval tasks at TRECVID 2022Code1
Free Lunch for Surgical Video Understanding by Distilling Self-SupervisionsCode1
-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory ConsolidationCode1
Task Graph Maximum Likelihood Estimation for Procedural Activity Understanding in Egocentric VideosCode1
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language ModelsCode1
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment GroundingCode1
How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary InvestigationCode1
A Comprehensive Study of Deep Video Action RecognitionCode1
Elaborative Rehearsal for Zero-shot Action RecognitionCode1
How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning?Code1
Show:102550
← PrevPage 36 of 115Next →

No leaderboard results yet.