SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 10761100 of 1149 papers

TitleStatusHype
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object UnderstandingCode0
Snippet-Aware Transformer With Multiple Action Elements for Skeleton-Based Action SegmentationCode0
Features Understanding in 3D CNNs for Actions Recognition in VideoCode0
Situational Scene Graph for Structured Human-centric Situation UnderstandingCode0
Exploring Temporal Information for Improved Video UnderstandingCode0
SeriesBench: A Benchmark for Narrative-Driven Drama Series UnderstandingCode0
ScVLM: Enhancing Vision-Language Model for Safety-Critical Event UnderstandingCode0
Exploiting Long-Term Dependencies for Generating Dynamic Scene GraphsCode0
Screencast Tutorial Video UnderstandingCode0
Video Object Segmentation using Supervoxel-Based GerrymanderingCode0
ScaleLong: A Multi-Timescale Benchmark for Long Video UnderstandingCode0
Representation Flow for Action RecognitionCode0
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity RecognitionCode0
Relation-aware Hierarchical Attention Framework for Video Question AnsweringCode0
Re-ID-AR: Improved Person Re-identification in Video via Joint Weakly Supervised Action RecognitionCode0
Recurrent Space-time Graph Neural NetworksCode0
TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic VideosCode0
ACVUBench: Audio-Centric Video Understanding BenchmarkCode0
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video ArchitecturesCode0
Win-Fail Action RecognitionCode0
VideoQA in the Era of LLMs: An Empirical StudyCode0
UAL-Bench: The First Comprehensive Unusual Activity Localization BenchmarkCode0
ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence AlignmentCode0
EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric OptimizationCode0
Enhancing Temporal Modeling of Video LLMs via Time GatingCode0
Show:102550
← PrevPage 44 of 46Next →

No leaderboard results yet.