Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

2020-06-23Unverified0· sign in to hype

Tosho Hirasawa, Zhishen Yang, Mamoru Komachi, Naoaki Okazaki

Unverified — Be the first to reproduce this paper.

Abstract

Video-guided machine translation as one of multimodal neural machine translation tasks targeting on generating high-quality text translation by tangibly engaging both video and text. In this work, we presented our video-guided machine translation system in approaching the Video-guided Machine Translation Challenge 2020. This system employs keyframe-based video feature extractions along with the video feature positional encoding. In the evaluation phase, our system scored 36.60 corpus-level BLEU-4 and achieved the 1st place on the Video-guided Machine Translation Challenge 2020.

Tasks

Machine Translation Translation Video-Guided Machine Translation

Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

Abstract

Tasks

Reproductions