Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation

2018-06-01NAACL 2018Unverified0· sign in to hype

Marianna Apidianaki, Guillaume Wisniewski, Anne Cocos, Chris Callison-Burch

Unverified — Be the first to reproduce this paper.

Abstract

We propose a variant of a well-known machine translation (MT) evaluation metric, HyTER (Dreyer and Marcu, 2012), which exploits reference translations enriched with meaning equivalent expressions. The original HyTER metric relied on hand-crafted paraphrase networks which restricted its applicability to new data. We test, for the first time, HyTER with automatically built paraphrase lattices. We show that although the metric obtains good results on small and carefully curated data with both manually and automatically selected substitutes, it achieves medium performance on much larger and noisier datasets, demonstrating the limits of the metric for tuning and evaluation of current MT systems.

Tasks

Machine Translation Translation

Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation

Abstract

Tasks

Reproductions