SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 931–940 of 1149 papers

Title	Date	Tasks	Status	Hype
UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection	Nov 29, 2021	Boundary DetectionContrastive Learning	—Unverified	0
UBoCo: Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection	Jan 1, 2022	Boundary DetectionContrastive Learning	—Unverified	0
Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges	Sep 25, 2023	Anomaly DetectionDense Video Captioning	—Unverified	0
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks	Mar 24, 2025	Common Sense ReasoningPrediction	—Unverified	0
Understanding Action Sequences based on Video Captioning for Learning-from-Observation	Dec 9, 2020	Video CaptioningVideo Understanding	—Unverified	0
Understanding Long Videos via LLM-Powered Entity Relation Graphs	Jan 27, 2025	EgoSchemaLarge Language Model	—Unverified	0
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation	Apr 10, 2021	Objectobject-detection	—Unverified	0
UniDual: A Unified Model for Image and Video Understanding	Jun 10, 2019	Multi-Task LearningVideo Understanding	—Unverified	0
Unified Graph Structured Models for Video Understanding	Mar 29, 2021	Action DetectionGraph Classification	—Unverified	0
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action	Jan 1, 2024	Image GenerationInstruction Following	—Unverified	0

Show:10 25 50

← PrevPage 94 of 115Next →

No leaderboard results yet.