SOTAVerified

Video Summarization

Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.

Source: Video Summarization Using Deep Neural Networks: A Survey Image credit: iJRASET

Papers

Showing 201250 of 280 papers

TitleStatusHype
A Novel Technique for Evidence based Conditional Inference in Deep Neural Networks via Latent Feature Perturbation0
Discriminative Feature Learning for Unsupervised Video SummarizationCode0
A Framework towards Domain Specific Video Summarization0
Vis-DSS: An Open-Source toolkit for Visual Data Selection and SummarizationCode0
A Dataset and Preliminary Results for Umpire Pose Detection Using SVM Classification of Deep Features0
Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing0
Diverse and Coherent Paragraph Generation from Images0
Retrospective Encoders for Video Summarization0
Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web PriorCode0
Improving Sequential Determinantal Point Processes for Supervised Video Summarization0
Query-Conditioned Three-Player Adversarial Network for Video Summarization0
How Local is the Local Diversity? Reinforcing Sequential Determinantal Point Processes with Dynamic Ground Sets for Supervised Video Summarization0
HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization0
A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos0
Video Summarization by Learning from Unpaired Data0
Video Summarization Using Fully Convolutional Sequence Networks0
FFNet: Video Fast-Forwarding via Reinforcement LearningCode0
A Memory Network Approach for Story-based Temporal Summarization of 360° Videos0
Dilated Temporal Relational Adversarial Network for Generic Video Summarization0
Viewpoint-aware Video Summarization0
Segmentation of Bleeding Regions in Wireless Capsule Endoscopy Images an Approach for inside Capsule Video Summarization0
Do Less, Get More: Streaming Submodular Maximization with Subsampling0
Unsupervised Object-Level Video Summarization with Online Motion Auto-Encoder0
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness RewardCode0
Subset Selection and Summarization in Sequential Data0
Common Action Discovery and Localization in Unconstrained Videos0
Summarization of User-Generated Sports Video by Using Deep Action Recognition Features0
Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video0
Video Summarization with Attention-Based Encoder-Decoder Networks0
CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization0
ElasticPlay: Interactive Video Summarization with Dynamic Time Budgets0
Show and Recall: Learning What Makes Videos Memorable0
Query-Focused Video Summarization: Dataset, Evaluation, and A Memory Network Based Approach0
Query-Aware Sparse Coding for Multi-Video Summarization0
Enhancing Video Summarization via Vision-Language Embedding0
Online Summarization via Submodular and Convex Optimization0
Unsupervised Video Summarization With Adversarial LSTM NetworksCode0
Diversity-aware Multi-Video Summarization0
Multi-View Surveillance Video Summarization via Joint Embedding and Sparse Optimization0
Query-adaptive Video Summarization via Quality-aware Relevance EstimationCode0
A Unified Multi-Faceted Video Summarization System0
Temporal Tessellation: A Unified Approach for Video AnalysisCode0
Diversity Promoting Online Sampling for Streaming Video Summarization0
Video Summarization using Deep Semantic FeaturesCode0
Semantic Video Trailers0
Video Summarization in a Multi-View Camera Network0
Query-Focused Extractive Video Summarization0
Recognizing Micro-Actions and Reactions From Paired Egocentric Videos0
Highlight Detection With Pairwise Deep Ranking for First-Person Video Summarization0
A Paradigm for Building Generalized Models of Human Image Perception Through Data Fusion0
Show:102550
← PrevPage 5 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGL-SUMF1-score (Canonical)55.6Unverified
2RR-STGF1-score (Canonical)54.5Unverified
3DSNetF1-score (Canonical)53Unverified
4VASNetF1-score (Canonical)49.71Unverified
5M-AVSF1-score (Canonical)44.4Unverified
6CSTAKendall's Tau0.25Unverified
#ModelMetricClaimedVerifiedStatus
1RR-STGF1-score (Canonical)63Unverified
2DSNetF1-score (Canonical)62.1Unverified
3VASNetF1-score (Canonical)61.42Unverified
4M-AVSF1-score (Canonical)61Unverified
5PGL-SUMF1-score (Canonical)61Unverified
6CSTAKendall's Tau0.19Unverified
#ModelMetricClaimedVerifiedStatus
1Shotluck-Holmes (3.1B)CIDEr152.3Unverified
2Shotluck-Holmes (3.1B)CIDEr63.2Unverified
3SUM-shotCIDEr8.6Unverified
#ModelMetricClaimedVerifiedStatus
1EgoVLPv2F1 (avg)52.08Unverified
2EgoVLPF1 (avg)49.72Unverified
#ModelMetricClaimedVerifiedStatus
1PGL-SUMMAP (50%)61.6Unverified
#ModelMetricClaimedVerifiedStatus
1VTSUM-BLIP1 shot Micro-F123.5Unverified