SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 111120 of 1149 papers

TitleStatusHype
LVBench: An Extreme Long Video Understanding BenchmarkCode2
Vript: A Video Is Worth Thousands of WordsCode2
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo BenchmarkCode2
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long VideosCode2
Dense Connector for MLLMsCode2
Vision Mamba: A Comprehensive Survey and TaxonomyCode2
Foundation Models for Video Understanding: A SurveyCode2
Leveraging Temporal Contextualization for Video Action RecognitionCode2
LongVLM: Efficient Long Video Understanding via Large Language ModelsCode2
ST-LLM: Large Language Models Are Effective Temporal LearnersCode2
Show:102550
← PrevPage 12 of 115Next →

No leaderboard results yet.