SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1110 of 1149 papers

Title	Date	Tasks	Status	Hype	Score
StimuVAR: Spatiotemporal Stimuli-aware Video Affective Reasoning with Multimodal Large Language Models	Aug 31, 2024	Video Understanding	—Unverified	0	0
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition	Jan 8, 2023	Action RecognitionFacial Expression Recognition (FER)	—Unverified	0	0
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant	May 8, 2025	Language ModelingLanguage Modelling	—Unverified	0	0
Streaming Long Video Understanding with Large Language Models	May 25, 2024	Question AnsweringVideo Understanding	—Unverified	0	0
Streamlining Forest Wildfire Surveillance: AI-Enhanced UAVs Utilizing the FLAME Aerial Video Dataset for Lightweight and Efficient Monitoring	Aug 31, 2024	Disaster ResponseVideo Understanding	—Unverified	0	0
Students taught by multimodal teachers are superior action recognizers	Oct 9, 2022	Action RecognitionKnowledge Distillation	—Unverified	0	0
Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding	Jun 9, 2025	Contrastive LearningVideo Editing	—Unverified	0	0
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis	Jun 9, 2025	Action ClassificationBenchmarking	—Unverified	0	0
SVGraph: Learning Semantic Graphs from Instructional Videos	Jul 16, 2022	Graph LearningVideo Understanding	—Unverified	0	0
SVT: Supertoken Video Transformer for Efficient Video Understanding	Apr 1, 2023	Video Understanding	—Unverified	0	0

Show:10 25 50

← PrevPage 111 of 115Next →

No leaderboard results yet.