Video Captioning
Video Captioning is a task of automatic captioning a video by understanding the action and event in the video which can help in the retrieval of the video efficiently through text.
Source: NITS-VC System for VATEX Video Captioning Challenge 2020
Papers
Showing 1–10 of 473 papers
All datasetsMSR-VTTMSVDYouCook2VATEXActivityNet CaptionsMSRVTT-CTNMSVD-CTNHindi MSR-VTTTVCChinaOpen-1kMSVD-IndonesianShot2Story20K
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | mPLUG-2 | CIDEr | 80 | — | Unverified |
| 2 | VAST | CIDEr | 78 | — | Unverified |
| 3 | GIT2 | CIDEr | 75.9 | — | Unverified |
| 4 | VLAB | CIDEr | 74.9 | — | Unverified |
| 5 | COSA | CIDEr | 74.7 | — | Unverified |
| 6 | VALOR | CIDEr | 74 | — | Unverified |
| 7 | MaMMUT (ours) | CIDEr | 73.6 | — | Unverified |
| 8 | VideoCoCa | CIDEr | 73.2 | — | Unverified |
| 9 | RTQ | CIDEr | 69.3 | — | Unverified |
| 10 | HowToCaption | CIDEr | 65.3 | — | Unverified |