SOTAVerified

Learning Curricula for Multilingual Neural Machine Translation Training

2021-08-01MTSummit 2021Unverified0· sign in to hype

Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Low-resource Multilingual Neural Machine Translation (MNMT) is typically tasked with improving the translation performance on one or more language pairs with the aid of high-resource language pairs. In this paper and we propose two simple search based curricula – orderings of the multilingual training data – which help improve translation performance in conjunction with existing techniques such as fine-tuning. Additionally and we attempt to learn a curriculum for MNMT from scratch jointly with the training of the translation system using contextual multi-arm bandits. We show on the FLORES low-resource translation dataset that these learned curricula can provide better starting points for fine tuning and improve overall performance of the translation system.

Tasks

Reproductions