SOTAVerified

Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation

2017-08-01WS 2017Unverified0· sign in to hype

Marine Carpuat, Yogarshi Vyas, Xing Niu

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Parallel corpora are often not as parallel as one might assume: non-literal translations and noisy translations abound, even in curated corpora routinely used for training and evaluation. We use a cross-lingual textual entailment system to distinguish sentence pairs that are parallel in meaning from those that are not, and show that filtering out divergent examples from training improves translation quality.

Tasks

Reproductions