SOTAVerified

EvalD Reference-Less Discourse Evaluation for WMT18

2018-10-01WS 2018Unverified0· sign in to hype

Ond{\v{r}}ej Bojar, Ji{\v{r}}{\'\i} M{\'\i}rovsk{\'y}, Kate{\v{r}}ina Rysov{\'a}, Magdal{\'e}na Rysov{\'a}

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present the results of automatic evaluation of discourse in machine translation (MT) outputs using the EVALD tool. EVALD was originally designed and trained to assess the quality of human writing, for native speakers and foreign-language learners. MT has seen a tremendous leap in translation quality at the level of sentences and it is thus interesting to see if the human-level evaluation is becoming relevant.

Tasks

Reproductions