SOTAVerified

Cross-lingual tagger evaluation without test data

2017-04-01EACL 2017Unverified0· sign in to hype

{\v{Z}}eljko Agi{\'c}, Barbara Plank, Anders S{\o}gaard

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.

Tasks

Reproductions