Cross-lingual tagger evaluation without test data
2017-04-01EACL 2017Unverified0· sign in to hype
{\v{Z}}eljko Agi{\'c}, Barbara Plank, Anders S{\o}gaard
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.