Video Captioning
Video Captioning is a task of automatic captioning a video by understanding the action and event in the video which can help in the retrieval of the video efficiently through text.
Source: NITS-VC System for VATEX Video Captioning Challenge 2020
Papers
Showing 1–10 of 473 papers
All datasetsMSR-VTTMSVDYouCook2VATEXActivityNet CaptionsMSRVTT-CTNMSVD-CTNHindi MSR-VTTTVCChinaOpen-1kMSVD-IndonesianShot2Story20K
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | VideoCoCa | BLEU4 | 14.7 | — | Unverified |
| 2 | VLTinT (ae-test split) C3D/Ling | BLEU4 | 14.5 | — | Unverified |
| 3 | VLCap (ae-test split) - Appearance + Language | BLEU4 | 13.38 | — | Unverified |
| 4 | COOT (ae-test split) - Only Appearance features | BLEU4 | 10.85 | — | Unverified |
| 5 | MART (ae-test split) - Appearance + Flow | BLEU4 | 10.33 | — | Unverified |