Supervised Video Summarization
Supervised video summarization rely on datasets with human-labeled ground-truth annotations (either in the form of video summaries, as in the case of the SumMe dataset, or in the form of frame-level importance scores, as in the case of the TVSum dataset), based on which they try to discover the underlying criterion for video frame/fragment selection and video summarization.
Source: Video Summarization Using Deep Neural Networks: A Survey
Papers
Showing 21–28 of 28 papers
No leaderboard results yet.