SOTAVerified

Video Summarization

Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.

Source: Video Summarization Using Deep Neural Networks: A Survey Image credit: iJRASET

Papers

Showing 201250 of 280 papers

TitleStatusHype
"Previously on ..." From Recaps to Story Summarization0
Previously on ... From Recaps to Story Summarization0
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization0
Query-Aware Sparse Coding for Multi-Video Summarization0
Query-based Video Summarization with Pseudo Label Supervision0
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning0
Query-Conditioned Three-Player Adversarial Network for Video Summarization0
Query-Focused Extractive Video Summarization0
Query-Focused Video Summarization: Dataset, Evaluation, and A Memory Network Based Approach0
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization0
R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding0
R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding0
Realistic Video Summarization through VISIOCITY: A New Benchmark and Evaluation Framework0
Realizing Video Summarization from the Path of Language-based Semantic Understanding0
Real-time Video Summarization on Commodity Hardware0
Recognizing Micro-Actions and Reactions From Paired Egocentric Videos0
Reconstructive Sequence-Graph Network for Video Summarization0
REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing0
Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications0
Relational Reasoning Over Spatial-Temporal Graphs for Video Summarization0
Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance0
Retrospective Encoders for Video Summarization0
Role of Audio in Audio-Visual Video Summarization0
Saliency-based Video Summarization for Face Anti-spoofing0
SalSum: Saliency-based Video Summarization using Generative Adversarial Networks0
Scaling Submodular Maximization via Pruned Submodularity Graphs0
Scaling Up Video Summarization Pretraining with Large Language Models0
Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames0
Segmentation of Bleeding Regions in Wireless Capsule Endoscopy Images an Approach for inside Capsule Video Summarization0
Self-Attention Based Generative Adversarial Networks For Unsupervised Video Summarization0
Semantics for Large-Scale Multimedia: New Challenges for NLP0
Semantic Video Trailers0
Sequence-to-Segment Networks for Segment Detection0
Show and Recall: Learning What Makes Videos Memorable0
Distance Metric-Based Learning with Interpolated Latent Features for Location Classification in Endoscopy Image and Video0
Story-Driven Summarization for Egocentric Video0
Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer VisionCode0
Multi-Stream Dynamic Video SummarizationCode0
Does SpatioTemporal information benefit Two video summarization benchmarks?Code0
Your Interest, Your Summaries: Query-Focused Long Video SummarizationCode0
SD-VSum: A Method and Dataset for Script-Driven Video SummarizationCode0
Discriminative Feature Learning for Unsupervised Video SummarizationCode0
Unsupervised multi-latent space reinforcement learning framework for video summarization in ultrasound imagingCode0
ILS-SUMM: Iterated Local Search for Unsupervised Video SummarizationCode0
SELF-VS: Self-supervised Encoding Learning For Video SummarizationCode0
Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web PriorCode0
Video Summarization with Long Short-term MemoryCode0
Unsupervised Video Summarization via Iterative Training and Simplified GANCode0
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video SummarizationCode0
FFNet: Video Fast-Forwarding via Reinforcement LearningCode0
Show:102550
← PrevPage 5 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGL-SUMF1-score (Canonical)55.6Unverified
2RR-STGF1-score (Canonical)54.5Unverified
3DSNetF1-score (Canonical)53Unverified
4VASNetF1-score (Canonical)49.71Unverified
5M-AVSF1-score (Canonical)44.4Unverified
6CSTAKendall's Tau0.25Unverified
#ModelMetricClaimedVerifiedStatus
1RR-STGF1-score (Canonical)63Unverified
2DSNetF1-score (Canonical)62.1Unverified
3VASNetF1-score (Canonical)61.42Unverified
4PGL-SUMF1-score (Canonical)61Unverified
5M-AVSF1-score (Canonical)61Unverified
6CSTAKendall's Tau0.19Unverified
#ModelMetricClaimedVerifiedStatus
1Shotluck-Holmes (3.1B)CIDEr152.3Unverified
2Shotluck-Holmes (3.1B)CIDEr63.2Unverified
3SUM-shotCIDEr8.6Unverified
#ModelMetricClaimedVerifiedStatus
1EgoVLPv2F1 (avg)52.08Unverified
2EgoVLPF1 (avg)49.72Unverified
#ModelMetricClaimedVerifiedStatus
1PGL-SUMMAP (50%)61.6Unverified
#ModelMetricClaimedVerifiedStatus
1VTSUM-BLIP1 shot Micro-F123.5Unverified