Improving Attention Modeling with Implicit Distortion and Fertility for Machine Translation

2016-12-01COLING 2016Unverified0· sign in to hype

Shi Feng, Shujie Liu, Nan Yang, Mu Li, Ming Zhou, Kenny Q. Zhu

Unverified — Be the first to reproduce this paper.

Abstract

In neural machine translation, the attention mechanism facilitates the translation process by producing a soft alignment between the source sentence and the target sentence. However, without dedicated distortion and fertility models seen in traditional SMT systems, the learned alignment may not be accurate, which can lead to low translation quality. In this paper, we propose two novel models to improve attention-based neural machine translation. We propose a recurrent attention mechanism as an implicit distortion model, and a fertility conditioned decoder as an implicit fertility model. We conduct experiments on large-scale Chinese--English translation tasks. The results show that our models significantly improve both the alignment and translation quality compared to the original attention mechanism and several other variations.

Tasks

Decoder Machine Translation Sentence Translation

Improving Attention Modeling with Implicit Distortion and Fertility for Machine Translation

Abstract

Tasks

Reproductions