SOTAVerified

NRC-CNRC Machine Translation Systems for the 2021 AmericasNLP Shared Task

2021-06-01NAACL (AmericasNLP) 2021Unverified0· sign in to hype

Rebecca Knowles, Darlene Stewart, Samuel Larkin, Patrick Littell

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We describe the NRC-CNRC systems submitted to the AmericasNLP shared task on machine translation. We submitted systems translating from Spanish into Wixárika, Nahuatl, Rarámuri, and Guaraní. Our best neural machine translation systems used multilingual pretraining, ensembling, finetuning, training on parts of the development data, and subword regularization. We also submitted translation memory systems as a strong baseline.

Tasks

Reproductions