SOTAVerified

Towards cross-lingual application of language-specific PoS tagging schemes

2021-05-01NoDaLiDa 2021Unverified0· sign in to hype

Hinrik Hafsteinsson, Anton Karl Ingason

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We describe the process of conversion between the PoS tagging schemes of two languages, the Icelandic MIM-GOLD tagging scheme and the Faroese Sosialurin tagging scheme. These tagging schemes are functionally similar but use separate ways to encode fine-grained morphological information on tokenised text. As Faroese and Icelandic are lexically and grammatically similar, having a systematic method to convert between these two tagging schemes would be beneficial in the field of language technology, specifically in research on transfer learning between the two languages. As a product of our work, we present a provisional version of Icelandic corpora, prepared in the Faroese PoS tagging scheme, ready for use in cross-lingual NLP applications.

Tasks

Reproductions