Adaptive Weighting for Neural Machine Translation

2018-08-01COLING 2018Code Available0· sign in to hype

Yachao Li, Junhui Li, Min Zhang

Code Available — Be the first to reproduce this paper.

Code

github.com/liyc7711/weighted-nmt
OfficialIn papernone★ 0

Abstract

In the popular sequence to sequence (seq2seq) neural machine translation (NMT), there exist many weighted sum models (WSMs), each of which takes a set of input and generates one output. However, the weights in a WSM are independent of each other and fixed for all inputs, suggesting that by ignoring different needs of inputs, the WSM lacks effective control on the influence of each input. In this paper, we propose adaptive weighting for WSMs to control the contribution of each input. Specifically, we apply adaptive weighting for both GRU and the output state in NMT. Experimentation on Chinese-to-English translation and English-to-German translation demonstrates that the proposed adaptive weighting is able to much improve translation accuracy by achieving significant improvement of 1.49 and 0.92 BLEU points for the two translation tasks. Moreover, we discuss in-depth on what type of information is encoded in the encoder and how information influences the generation of target words in the decoder.

Tasks

Decoder Machine Translation NMT Translation

Adaptive Weighting for Neural Machine Translation

Code

Abstract

Tasks

Reproductions