SOTAVerified

Supervised Video Summarization

Supervised video summarization rely on datasets with human-labeled ground-truth annotations (either in the form of video summaries, as in the case of the SumMe dataset, or in the form of frame-level importance scores, as in the case of the TVSum dataset), based on which they try to discover the underlying criterion for video frame/fragment selection and video summarization.

Source: Video Summarization Using Deep Neural Networks: A Survey

Papers

Showing 2128 of 28 papers

TitleStatusHype
Video Summarization with Attention-Based Encoder-Decoder Networks0
Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning0
Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web PriorCode0
CSTA: CNN-based Spatiotemporal Attention for Video SummarizationCode0
Discriminative Feature Learning for Unsupervised Video SummarizationCode0
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness RewardCode0
Test-Time Training with Self-Supervision for Generalization under Distribution ShiftsCode0
CLIP-It! Language-Guided Video SummarizationCode0
Show:102550
← PrevPage 3 of 3Next →

No leaderboard results yet.