SOTAVerified

Video Summarization

Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.

Source: Video Summarization Using Deep Neural Networks: A Survey Image credit: iJRASET

Papers

Showing 101150 of 280 papers

TitleStatusHype
A Framework towards Domain Specific Video Summarization0
A General Framework for Edited Video and Raw Video Summarization0
Agent-based Video Trimming0
A Graph-based Ranking Approach to Extract Key-frames for Static Video Summarization0
A Memory Network Approach for Story-based Temporal Summarization of 360° Videos0
A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos0
A Multi-stage deep architecture for summary generation of soccer videos0
An Attention-Based Speaker Naming Method for Online Adaptation in Non-Fixed Scenarios0
An Enhanced Method For Evaluating Automatic Video Summaries0
A New Action Recognition Framework for Video Highlights Summarization in Sporting Events0
Submodular Maximization in Clean Linear Time0
A Novel Approach for Robust Multi Human Action Recognition and Summarization based on 3D Convolutional Neural Networks0
A Novel Trustworthy Video Summarization Algorithm Through a Mixture of LoRA Experts0
A Paradigm for Building Generalized Models of Human Image Perception Through Data Fusion0
A Survey on Patch-based Synthesis: GPU Implementation and Optimization0
A Survey on Recent Advances of Computer Vision Algorithms for Egocentric Video0
Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers0
Attentive monitoring of multiple video streams driven by a Bayesian foraging strategy0
Audiovisual Highlight Detection in Videos0
AudioVisual Video Summarization0
A Unified Multi-Faceted Video Summarization System0
Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention0
A Stacking Ensemble Approach for Supervised Video Summarization0
Beyond the Frame: Single and mutilple video summarization method with user-defined length0
Causalainer: Causal Explainer for Automatic Video Summarization0
Causal Video Summarizer for Video Exploration0
CFSum: A Transformer-Based Multi-Modal Video Summarization Framework With Coarse-Fine Fusion0
CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization0
Common Action Discovery and Localization in Unconstrained Videos0
Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning0
Comprehensive Video Understanding: Video summarization with content-based video recommender design0
Conditional Modeling Based Automatic Video Summarization0
Co-Regularized Deep Representations for Video Summarization0
Creating Summaries from User Videos0
Cycle-SUM: Cycle-consistent Adversarial LSTM Networks for Unsupervised Video Summarization0
DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization0
Demystifying Multi-Faceted Video Summarization: Tradeoff Between Diversity,Representation, Coverage and Importance0
Detecting Engagement in Egocentric Video0
Discovery of Shared Semantic Spaces for Multi-Scene Video Query and Summarization0
Diverse and Coherent Paragraph Generation from Images0
Diverse Sequential Subset Selection for Supervised Video Summarization0
Show and Recall: Learning What Makes Videos Memorable0
Distance Metric-Based Learning with Interpolated Latent Features for Location Classification in Endoscopy Image and Video0
Story-Driven Summarization for Egocentric Video0
Stream Clipper: Scalable Submodular Maximization on Stream0
Subset Selection and Summarization in Sequential Data0
SumGraph: Video Summarization via Recursive Graph Modeling0
Summarization of User-Generated Sports Video by Using Deep Action Recognition Features0
Summary Transfer: Exemplar-based Subset Selection for Video Summarization0
SUSiNet: See, Understand and Summarize it0
Show:102550
← PrevPage 3 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGL-SUMF1-score (Canonical)55.6Unverified
2RR-STGF1-score (Canonical)54.5Unverified
3DSNetF1-score (Canonical)53Unverified
4VASNetF1-score (Canonical)49.71Unverified
5M-AVSF1-score (Canonical)44.4Unverified
6CSTAKendall's Tau0.25Unverified
#ModelMetricClaimedVerifiedStatus
1RR-STGF1-score (Canonical)63Unverified
2DSNetF1-score (Canonical)62.1Unverified
3VASNetF1-score (Canonical)61.42Unverified
4PGL-SUMF1-score (Canonical)61Unverified
5M-AVSF1-score (Canonical)61Unverified
6CSTAKendall's Tau0.19Unverified
#ModelMetricClaimedVerifiedStatus
1Shotluck-Holmes (3.1B)CIDEr152.3Unverified
2Shotluck-Holmes (3.1B)CIDEr63.2Unverified
3SUM-shotCIDEr8.6Unverified
#ModelMetricClaimedVerifiedStatus
1EgoVLPv2F1 (avg)52.08Unverified
2EgoVLPF1 (avg)49.72Unverified
#ModelMetricClaimedVerifiedStatus
1PGL-SUMMAP (50%)61.6Unverified
#ModelMetricClaimedVerifiedStatus
1VTSUM-BLIP1 shot Micro-F123.5Unverified