SOTAVerified|Agents Browse Leaderboard About Blog

Video Description

The goal of automatic Video Description is to tell a story about events happening in a video. While early Video Description methods produced captions for short clips that were manually segmented to contain a single event of interest, more recently dense video captioning has been proposed to both segment distinct events in time and describe them in a series of coherent sentences. This problem is a generalization of dense image region captioning and has many practical applications, such as generating textual summaries for the visually impaired, or detecting and describing important events in surveillance footage.

Source: Joint Event Detection and Description in Continuous Video Streams

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 104 papers

Title	Date	Tasks	Status	Hype
Identity-Aware Multi-Sentence Video Description	Aug 22, 2020	Gender PredictionSentence	CodeCode Available	1
Delving Deeper into the Decoder for Video Captioning	Jan 16, 2020	DecoderSentence	CodeCode Available	1
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research	Apr 6, 2019	Machine TranslationTranslation	CodeCode Available	1
Grounded Video Description	Dec 17, 2018	Image DescriptionSentence	CodeCode Available	1
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7	Jun 1, 2018	Video DescriptionVisual Dialog	CodeCode Available	1
Using Descriptive Video Services to Create a Large Data Source for Video Annotation Research	Mar 3, 2015	DescriptiveVideo Description	CodeCode Available	1
DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description	Mar 31, 2025	Video DescriptionVideo Understanding	—Unverified	0
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation	Mar 31, 2025	HallucinationHuman-Object Interaction Detection	—Unverified	0
Cross-Modal Learning for Music-to-Music-Video Description Generation	Mar 14, 2025	Video DescriptionVideo Generation	—Unverified	0
VideoA11y: Method and Dataset for Accessible Video Description	Feb 27, 2025	Video Description	—Unverified	0

Show:10 25 50

← PrevPage 2 of 11Next →

No leaderboard results yet.