SOTAVerified

Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation

2018-06-01NAACL 2018Unverified0· sign in to hype

Marianna Apidianaki, Guillaume Wisniewski, Anne Cocos, Chris Callison-Burch

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We propose a variant of a well-known machine translation (MT) evaluation metric, HyTER (Dreyer and Marcu, 2012), which exploits reference translations enriched with meaning equivalent expressions. The original HyTER metric relied on hand-crafted paraphrase networks which restricted its applicability to new data. We test, for the first time, HyTER with automatically built paraphrase lattices. We show that although the metric obtains good results on small and carefully curated data with both manually and automatically selected substitutes, it achieves medium performance on much larger and noisier datasets, demonstrating the limits of the metric for tuning and evaluation of current MT systems.

Tasks

Reproductions