SOTAVerified|Agents Browse Leaderboard About Blog

Video Description

The goal of automatic Video Description is to tell a story about events happening in a video. While early Video Description methods produced captions for short clips that were manually segmented to contain a single event of interest, more recently dense video captioning has been proposed to both segment distinct events in time and describe them in a series of coherent sentences. This problem is a generalization of dense image region captioning and has many practical applications, such as generating textual summaries for the visually impaired, or detecting and describing important events in surveillance footage.

Source: Joint Event Detection and Description in Continuous Video Streams

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 104 papers

Title	Date	Tasks	Status	Hype
Incorporating Background Knowledge into Video Description Generation	Oct 1, 2018	DecoderText Generation	—Unverified	0
A Dataset for Telling the Stories of Social Media Videos	Oct 1, 2018	SentenceVideo Captioning	—Unverified	0
Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions	Aug 27, 2018	TranslationVideo Description	—Unverified	0
Bridge Video and Text with Cascade Syntactic Structure	Aug 1, 2018	AttributeObject	—Unverified	0
Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data	Jul 1, 2018	Image DescriptionMachine Translation	—Unverified	0
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features	Jun 21, 2018	Question AnsweringVideo Description	CodeCode Available	0
Interpretable Video Captioning via Trajectory Structured Localization	Jun 1, 2018	DecoderImage Captioning	—Unverified	0
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7	Jun 1, 2018	Video DescriptionVisual Dialog	CodeCode Available	1
Video Description: A Survey of Methods, Datasets and Evaluation Metrics	Jun 1, 2018	DiversityLanguage Modeling	—Unverified	0
Incorporating Semantic Attention in Video Description Generation	May 1, 2018	Image CaptioningImage Classification	—Unverified	0

Show:10 25 50

← PrevPage 7 of 11Next →

No leaderboard results yet.