Appraise Evaluation Framework for Machine Translation
2018-08-01COLING 2018Unverified0· sign in to hype
Christian Federmann
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We present Appraise, an open-source framework for crowd-based annotation tasks, notably for evaluation of machine translation output. This is the software used to run the yearly evaluation campaigns for shared tasks at the WMT Conference on Machine Translation. It has also been used at IWSLT 2017 and, recently, to measure human parity for machine translation for Chinese to English news text. The demo will present the full end-to-end lifecycle of an Appraise evaluation campaign, from task creation to annotation and interpretation of results.