SOTAVerified

Multilingual Neural Machine Translation: Case-study for Catalan, Spanish and Portuguese Romance Languages

2020-11-01WMT (EMNLP) 2020Unverified0· sign in to hype

Pere Vergés Boncompte, Marta R. Costa-jussà

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In this paper, we describe the TALP-UPC participation in the WMT Similar Language Translation task between Catalan, Spanish, and Portuguese, all of them, Romance languages. We made use of different techniques to improve the translation between these languages. The multilingual shared encoder/decoder has been used for all of them. Additionally, we applied back-translation to take advantage of the monolingual data. Finally, we have applied fine-tuning to improve the in-domain data. Each of these techniques brings improvements over the previous one. In the official evaluation, our system was ranked 1st in the Portuguese-to-Spanish direction, 2nd in the opposite direction, and 3rd in the Catalan-Spanish pair.

Tasks

Reproductions