SOTAVerified

An Extensive Empirical Evaluation of Character-Based Morphological Tagging for 14 Languages

2017-04-01EACL 2017Unverified0· sign in to hype

Georg Heigold, Guenter Neumann, Josef van Genabith

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper investigates neural character-based morphological tagging for languages with complex morphology and large tag sets. Character-based approaches are attractive as they can handle rarely- and unseen words gracefully. We evaluate on 14 languages and observe consistent gains over a state-of-the-art morphological tagger across all languages except for English and French, where we match the state-of-the-art. We compare two architectures for computing character-based word vectors using recurrent (RNN) and convolutional (CNN) nets. We show that the CNN based approach performs slightly worse and less consistently than the RNN based approach. Small but systematic gains are observed when combining the two architectures by ensembling.

Tasks

Reproductions