Semantic-Aware Pretraining for Dense Video Captioning

2022-04-13Unverified0· sign in to hype

Teng Wang, Zhu Liu, Feng Zheng, Zhichao Lu, Ran Cheng, Ping Luo

Unverified — Be the first to reproduce this paper.

Abstract

This report describes the details of our approach for the event dense-captioning task in ActivityNet Challenge 2021. We present a semantic-aware pretraining method for dense video captioning, which empowers the learned features to recognize high-level semantic concepts. Diverse video features of different modalities are fed into an event captioning module to generate accurate and meaningful sentences. Our final ensemble model achieves a 10.00 METEOR score on the test set.

Tasks

Dense Captioning Dense Video Captioning Video Captioning

Semantic-Aware Pretraining for Dense Video Captioning

Abstract

Tasks

Reproductions