Video Captioning
Video Captioning is a task of automatic captioning a video by understanding the action and event in the video which can help in the retrieval of the video efficiently through text.
Source: NITS-VC System for VATEX Video Captioning Challenge 2020
Papers
Showing 1–10 of 473 papers
All datasetsMSR-VTTMSVDYouCook2VATEXActivityNet CaptionsMSRVTT-CTNMSVD-CTNHindi MSR-VTTTVCChinaOpen-1kMSVD-IndonesianShot2Story20K
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | VALOR | BLEU-4 | 45.6 | — | Unverified |
| 2 | VAST | BLEU-4 | 45 | — | Unverified |
| 3 | COSA | BLEU-4 | 43.7 | — | Unverified |
| 4 | VideoCoCa | BLEU-4 | 39.7 | — | Unverified |
| 5 | IcoCap (ViT-B/16) | BLEU-4 | 37.4 | — | Unverified |
| 6 | IcoCap (ViT-B/32) | BLEU-4 | 36.9 | — | Unverified |
| 7 | VASTA (Kinetics-backbone) | BLEU-4 | 36.25 | — | Unverified |
| 8 | CoCap (ViT/L14) | BLEU-4 | 35.8 | — | Unverified |
| 9 | ORG-TRL | BLEU-4 | 32.1 | — | Unverified |
| 10 | NITS-VC | BLEU-4 | 20 | — | Unverified |