Video Captioning
Video Captioning is a task of automatic captioning a video by understanding the action and event in the video which can help in the retrieval of the video efficiently through text.
Source: NITS-VC System for VATEX Video Captioning Challenge 2020
Papers
Showing 1–10 of 473 papers
All datasetsMSR-VTTMSVDYouCook2VATEXActivityNet CaptionsMSRVTT-CTNMSVD-CTNHindi MSR-VTTTVCChinaOpen-1kMSVD-IndonesianShot2Story20K
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | MaMMUT | CIDEr | 195.6 | — | Unverified |
| 2 | VLAB | CIDEr | 179.8 | — | Unverified |
| 3 | VALOR | CIDEr | 178.5 | — | Unverified |
| 4 | COSA | CIDEr | 178.5 | — | Unverified |
| 5 | mPLUG-2 | CIDEr | 165.8 | — | Unverified |
| 6 | HowToCaption | CIDEr | 154.2 | — | Unverified |
| 7 | HiTeA | CIDEr | 146.9 | — | Unverified |
| 8 | Vid2Seq | CIDEr | 146.2 | — | Unverified |
| 9 | VIOLETv2 | CIDEr | 139.2 | — | Unverified |
| 10 | RTQ | CIDEr | 123.4 | — | Unverified |