SOTAVerified

Video Summarization

Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.

Source: Video Summarization Using Deep Neural Networks: A Survey Image credit: iJRASET

Papers

Showing 251280 of 280 papers

TitleStatusHype
From Keyframes to Key Objects: Video Summarization by Representative Object Proposal Selection0
Scaling Submodular Maximization via Pruned Submodularity Graphs0
Stream Clipper: Scalable Submodular Maximization on Stream0
Video Summarization with Long Short-term MemoryCode0
Long-Term Identity-Aware Multi-Person Tracking for Surveillance Video Summarization0
Detecting Engagement in Egocentric Video0
Summary Transfer: Exemplar-based Subset Selection for Video Summarization0
Discovery of Shared Semantic Spaces for Multi-Scene Video Query and Summarization0
TVSum: Summarizing Web Videos Using Titles0
Video Co-Summarization: Video Summarization by Visual Co-Occurrence0
Video Summarization by Learning Submodular Mixtures of Objectives0
Gaze-Enabled Egocentric Video Summarization via Constrained Submodular Maximization0
Predicting Important Objects for Egocentric Video Summarization0
Visual Recognition by Counting Instances: A Multi-Instance Cardinality Potential Kernel0
Co-Regularized Deep Representations for Video Summarization0
A Survey on Recent Advances of Computer Vision Algorithms for Egocentric Video0
Diverse Sequential Subset Selection for Supervised Video Summarization0
Large-Margin Determinantal Point Processes0
Attentive monitoring of multiple video streams driven by a Bayesian foraging strategy0
Temporally Coherent Bayesian Models for Entity Discovery in Videos by Tracklet Clustering0
Semantics for Large-Scale Multimedia: New Challenges for NLP0
Joint Summarization of Large-scale Collections of Web Images and Videos for Storyline Reconstruction0
Multi-view Metric Learning for Multi-view Video Summarization0
Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance0
VSCAN: An Enhanced Video Summarization using Density-based Spatial Clustering0
An Enhanced Method For Evaluating Automatic Video Summaries0
Creating Summaries from User Videos0
Story-Driven Summarization for Egocentric Video0
Large-Scale Video Summarization Using Web-Image Priors0
Generating Natural Language Summaries for Multimedia0
Show:102550
← PrevPage 6 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGL-SUMF1-score (Canonical)55.6Unverified
2RR-STGF1-score (Canonical)54.5Unverified
3DSNetF1-score (Canonical)53Unverified
4VASNetF1-score (Canonical)49.71Unverified
5M-AVSF1-score (Canonical)44.4Unverified
6CSTAKendall's Tau0.25Unverified
#ModelMetricClaimedVerifiedStatus
1RR-STGF1-score (Canonical)63Unverified
2DSNetF1-score (Canonical)62.1Unverified
3VASNetF1-score (Canonical)61.42Unverified
4PGL-SUMF1-score (Canonical)61Unverified
5M-AVSF1-score (Canonical)61Unverified
6CSTAKendall's Tau0.19Unverified
#ModelMetricClaimedVerifiedStatus
1Shotluck-Holmes (3.1B)CIDEr152.3Unverified
2Shotluck-Holmes (3.1B)CIDEr63.2Unverified
3SUM-shotCIDEr8.6Unverified
#ModelMetricClaimedVerifiedStatus
1EgoVLPv2F1 (avg)52.08Unverified
2EgoVLPF1 (avg)49.72Unverified
#ModelMetricClaimedVerifiedStatus
1PGL-SUMMAP (50%)61.6Unverified
#ModelMetricClaimedVerifiedStatus
1VTSUM-BLIP1 shot Micro-F123.5Unverified