Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation

2020-06-09ACL 2020Code Available1· sign in to hype

Qiu Ran, Yankai Lin, Peng Li, Jie zhou

Code Available — Be the first to reproduce this paper.

Code

github.com/ranqiu92/RecoverSAT
OfficialIn paperpytorch★ 18

Abstract

Non-autoregressive neural machine translation (NAT) predicts the entire target sequence simultaneously and significantly accelerates inference process. However, NAT discards the dependency information in a sentence, and thus inevitably suffers from the multi-modality problem: the target tokens may be provided by different possible translations, often causing token repetitions or missing. To alleviate this problem, we propose a novel semi-autoregressive model RecoverSAT in this work, which generates a translation as a sequence of segments. The segments are generated simultaneously while each segment is predicted token-by-token. By dynamically determining segment length and deleting repetitive segments, RecoverSAT is capable of recovering from repetitive and missing token errors. Experimental results on three widely-used benchmark datasets show that our proposed model achieves more than 4 speedup while maintaining comparable performance compared with the corresponding autoregressive model.

Tasks

Machine Translation Sentence Translation

Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation

Code

Abstract

Tasks

Reproductions