Monolingual Embeddings for Low Resourced Neural Machine Translation

2017-12-01IWSLT 2017Code Available0· sign in to hype

Mattia Antonino Di Gangi, Marcello Federico

Code Available — Be the first to reproduce this paper.

Code

github.com/mattiadg/nmt-external-embeddings
OfficialIn papernone★ 0

Abstract

Neural machine translation (NMT) is the state of the art for machine translation, and it shows the best performance when there is a considerable amount of data available. When only little data exist for a language pair, the model cannot produce good representations for words, particularly for rare words. One common solution consists in reducing data sparsity by segmenting words into sub-words, in order to allow rare words to have shared representations with other words. Taking a different approach, in this paper we present a method to feed an NMT network with word embeddings trained on monolingual data, which are combined with the task-specific embeddings learned at training time. This method can leverage an embedding matrix with a huge number of words, which can therefore extend the word-level vocabulary. Our experiments on two language pairs show good results for the typical low-resourced data scenario (IWSLT in-domain dataset). Our consistent improvements over the baselines represent a positive proof about the possibility to leverage models pre-trained on monolingual data in NMT.

Tasks

Machine Translation NMT Translation Word Embeddings

Monolingual Embeddings for Low Resourced Neural Machine Translation

Code

Abstract

Tasks

Reproductions