SOTAVerified

Improving Back-Translation with Uncertainty-based Confidence Estimation

2019-08-31IJCNLP 2019Code Available0· sign in to hype

Shuo Wang, Yang Liu, Chao Wang, Huanbo Luan, Maosong Sun

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

While back-translation is simple and effective in exploiting abundant monolingual corpora to improve low-resource neural machine translation (NMT), the synthetic bilingual corpora generated by NMT models trained on limited authentic bilingual data are inevitably noisy. In this work, we propose to quantify the confidence of NMT model predictions based on model uncertainty. With word- and sentence-level confidence measures based on uncertainty, it is possible for back-translation to better cope with noise in synthetic bilingual corpora. Experiments on Chinese-English and English-German translation tasks show that uncertainty-based confidence estimation significantly improves the performance of back-translation.

Tasks

Reproductions