Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation

2021-10-12AMTA 2022Code Available0· sign in to hype

Weiting Tan, Shuoyang Ding, Huda Khayrallah, Philipp Koehn

Code Available — Be the first to reproduce this paper.

Code

github.com/steventan0110/NMTModelAttack
Officialpytorch★ 3

Abstract

Neural Machine Translation (NMT) models are known to suffer from noisy inputs. To make models robust, we generate adversarial augmentation samples that attack the model and preserve the source-side semantic meaning at the same time. To generate such samples, we propose a doubly-trained architecture that pairs two NMT models of opposite translation directions with a joint loss function, which combines the target-side attack and the source-side semantic similarity constraint. The results from our experiments across three different language pairs and two evaluation metrics show that these adversarial samples improve the model robustness.

Tasks

Data Augmentation Machine Translation NMT Semantic Similarity Semantic Textual Similarity Translation

Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation

Code

Abstract

Tasks

Reproductions