SOTAVerified

Supervised Video Summarization

Supervised video summarization rely on datasets with human-labeled ground-truth annotations (either in the form of video summaries, as in the case of the SumMe dataset, or in the form of frame-level importance scores, as in the case of the TVSum dataset), based on which they try to discover the underlying criterion for video frame/fragment selection and video summarization.

Source: Video Summarization Using Deep Neural Networks: A Survey

Papers

Showing 1120 of 28 papers

TitleStatusHype
A Stacking Ensemble Approach for Supervised Video Summarization0
Hierarchical Multimodal Transformer to Summarize Videos0
Use of Affective Visual Information for Summarization of Human-Centric Videos0
CLIP-It! Language-Guided Video SummarizationCode0
Self-Attention Recurrent Summarization Network with Reinforcement Learning for Video Summarization TaskCode1
Supervised Video Summarization via Multiple Feature Sets with Parallel AttentionCode1
How Good is a Video Summary? A New Benchmarking Dataset and Evaluation Framework Towards Realistic Video Summarization0
DSNet: A Flexible Detect-to-Summarize Network for Video SummarizationCode1
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization0
Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.