SOTAVerified

Video Summarization

Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.

Source: Video Summarization Using Deep Neural Networks: A Survey Image credit: iJRASET

Papers

Showing 101150 of 280 papers

TitleStatusHype
Multi-view Metric Learning for Multi-view Video Summarization0
Multi-View Surveillance Video Summarization via Joint Embedding and Sparse Optimization0
NEWSKVQA: Knowledge-Aware News Video Question Answering0
NLP Driven Ensemble Based Automatic Subtitle Generation and Semantic Video Summarization Technique0
Non-Monotone Submodular Maximization with Multiple Knapsacks in Static and Dynamic Settings0
Online Learnable Keyframe Extraction in Videos and its Application with Semantic Word Vector in Action Recognition0
Online Summarization via Submodular and Convex Optimization0
Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing0
Parameter-free Video Segmentation for Vision and Language Understanding0
Pegasus-v1 Technical Report0
Personalized Video Summarization by Multimodal Video Understanding0
Personalized Video Summarization using Text-Based Queries and Conditional Modeling0
Predicting Important Objects for Egocentric Video Summarization0
"Previously on ..." From Recaps to Story Summarization0
Previously on ... From Recaps to Story Summarization0
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization0
Query-Aware Sparse Coding for Multi-Video Summarization0
Query-based Video Summarization with Pseudo Label Supervision0
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning0
Query-Conditioned Three-Player Adversarial Network for Video Summarization0
Query-Focused Extractive Video Summarization0
Query-Focused Video Summarization: Dataset, Evaluation, and A Memory Network Based Approach0
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization0
Realistic Video Summarization through VISIOCITY: A New Benchmark and Evaluation Framework0
Realizing Video Summarization from the Path of Language-based Semantic Understanding0
Real-time Video Summarization on Commodity Hardware0
Reconstructive Sequence-Graph Network for Video Summarization0
REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing0
Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications0
Relational Reasoning Over Spatial-Temporal Graphs for Video Summarization0
Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance0
Retrospective Encoders for Video Summarization0
Role of Audio in Audio-Visual Video Summarization0
Saliency-based Video Summarization for Face Anti-spoofing0
SalSum: Saliency-based Video Summarization using Generative Adversarial Networks0
Scaling Submodular Maximization via Pruned Submodularity Graphs0
Scaling Up Video Summarization Pretraining with Large Language Models0
Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames0
Segmentation of Bleeding Regions in Wireless Capsule Endoscopy Images an Approach for inside Capsule Video Summarization0
Self-Attention Based Generative Adversarial Networks For Unsupervised Video Summarization0
Semantics for Large-Scale Multimedia: New Challenges for NLP0
Semantic Video Trailers0
Sequence-to-Segment Networks for Segment Detection0
Show and Recall: Learning What Makes Videos Memorable0
Distance Metric-Based Learning with Interpolated Latent Features for Location Classification in Endoscopy Image and Video0
Story-Driven Summarization for Egocentric Video0
Stream Clipper: Scalable Submodular Maximization on Stream0
Subset Selection and Summarization in Sequential Data0
SumGraph: Video Summarization via Recursive Graph Modeling0
Summarization of User-Generated Sports Video by Using Deep Action Recognition Features0
Show:102550
← PrevPage 3 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGL-SUMF1-score (Canonical)55.6Unverified
2RR-STGF1-score (Canonical)54.5Unverified
3DSNetF1-score (Canonical)53Unverified
4VASNetF1-score (Canonical)49.71Unverified
5M-AVSF1-score (Canonical)44.4Unverified
6CSTAKendall's Tau0.25Unverified
#ModelMetricClaimedVerifiedStatus
1RR-STGF1-score (Canonical)63Unverified
2DSNetF1-score (Canonical)62.1Unverified
3VASNetF1-score (Canonical)61.42Unverified
4PGL-SUMF1-score (Canonical)61Unverified
5M-AVSF1-score (Canonical)61Unverified
6CSTAKendall's Tau0.19Unverified
#ModelMetricClaimedVerifiedStatus
1Shotluck-Holmes (3.1B)CIDEr152.3Unverified
2Shotluck-Holmes (3.1B)CIDEr63.2Unverified
3SUM-shotCIDEr8.6Unverified
#ModelMetricClaimedVerifiedStatus
1EgoVLPv2F1 (avg)52.08Unverified
2EgoVLPF1 (avg)49.72Unverified
#ModelMetricClaimedVerifiedStatus
1PGL-SUMMAP (50%)61.6Unverified
#ModelMetricClaimedVerifiedStatus
1VTSUM-BLIP1 shot Micro-F123.5Unverified