Video Summarization
Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.
Source: Video Summarization Using Deep Neural Networks: A Survey Image credit: iJRASET
Papers
Showing 61–70 of 280 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Shotluck-Holmes (3.1B) | CIDEr | 152.3 | — | Unverified |
| 2 | Shotluck-Holmes (3.1B) | CIDEr | 63.2 | — | Unverified |
| 3 | SUM-shot | CIDEr | 8.6 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | PGL-SUM | MAP (50%) | 61.6 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | VTSUM-BLIP | 1 shot Micro-F1 | 23.5 | — | Unverified |