Video Captioning
Video Captioning is a task of automatic captioning a video by understanding the action and event in the video which can help in the retrieval of the video efficiently through text.
Source: NITS-VC System for VATEX Video Captioning Challenge 2020
Papers
Showing 1–10 of 473 papers
All datasetsMSR-VTTMSVDYouCook2VATEXActivityNet CaptionsMSRVTT-CTNMSVD-CTNHindi MSR-VTTTVCChinaOpen-1kMSVD-IndonesianShot2Story20K
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | VAST | BLEU-4 | 18.2 | — | Unverified |
| 2 | UniVL + MELTR | BLEU-4 | 17.92 | — | Unverified |
| 3 | UniVL | BLEU-4 | 17.35 | — | Unverified |
| 4 | VideoCoCa | BLEU-4 | 14.2 | — | Unverified |
| 5 | VLM | BLEU-4 | 12.27 | — | Unverified |
| 6 | E2vidD6-MASSvid-BiD | BLEU-4 | 12.04 | — | Unverified |
| 7 | TextKG | BLEU-4 | 11.7 | — | Unverified |
| 8 | COOT | BLEU-4 | 11.3 | — | Unverified |
| 9 | COSA | BLEU-4 | 10.1 | — | Unverified |
| 10 | HowToCaption | BLEU-4 | 8.8 | — | Unverified |