Linguistic Input Features Improve Neural Machine Translation

2016-06-09WS 2016Code Available0· sign in to hype

Rico Sennrich, Barry Haddow

Code Available — Be the first to reproduce this paper.

Code

github.com/rsennrich/wmt16-scripts
OfficialIn papernone★ 0

Abstract

Neural machine translation has recently achieved impressive results, while using little in the way of external linguistic information. In this paper we show that the strong learning capability of neural MT models does not make linguistic features redundant; they can be easily incorporated to provide further improvements in performance. We generalize the embedding layer of the encoder in the attentional encoder--decoder architecture to support the inclusion of arbitrary features, in addition to the baseline word feature. We add morphological features, part-of-speech tags, and syntactic dependency labels as input features to English<->German, and English->Romanian neural machine translation systems. In experiments on WMT16 training and test sets, we find that linguistic input features improve model quality according to three metrics: perplexity, BLEU and CHRF3. An open-source implementation of our neural MT system is available, as are sample files and configurations.

Tasks

Decoder Machine Translation Translation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
WMT2016 English-German	Linguistic Input Features	BLEU score	28.4	—	Unverified
WMT2016 German-English	Linguistic Input Features	BLEU score	32.9	—	Unverified

Linguistic Input Features Improve Neural Machine Translation

Code

Abstract

Tasks

Benchmark Results

Reproductions