On the Evaluation of Machine Translation for Terminology Consistency

2021-06-22Code Available1· sign in to hype

Md Mahfuz ibn Alam, Antonios Anastasopoulos, Laurent Besacier, James Cross, Matthias Gallé, Philipp Koehn, Vassilina Nikoulina

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/mahfuzibnalam/terminology_evaluation
OfficialIn papernone★ 21

Abstract

As neural machine translation (NMT) systems become an important part of professional translator pipelines, a growing body of work focuses on combining NMT with terminologies. In many scenarios and particularly in cases of domain adaptation, one expects the MT output to adhere to the constraints provided by a terminology. In this work, we propose metrics to measure the consistency of MT output with regards to a domain terminology. We perform studies on the COVID-19 domain over 5 languages, also performing terminology-targeted human evaluation. We open-source the code for computing all proposed metrics: https://github.com/mahfuzibnalam/terminology_evaluation

Tasks

Domain Adaptation Machine Translation NMT Translation

On the Evaluation of Machine Translation for Terminology Consistency

Code

Abstract

Tasks

Reproductions