SOTAVerified

Neural Transition-based String Transduction for Limited-Resource Setting in Morphology

2018-08-01COLING 2018Code Available0· sign in to hype

Peter Makarov, Simon Clematide

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We present a neural transition-based model that uses a simple set of edit actions (copy, delete, insert) for morphological transduction tasks such as inflection generation, lemmatization, and reinflection. In a large-scale evaluation on four datasets and dozens of languages, our approach consistently outperforms state-of-the-art systems on low and medium training-set sizes and is competitive in the high-resource setting. Learning to apply a generic copy action enables our approach to generalize quickly from a few data points. We successfully leverage minimum risk training to compensate for the weaknesses of MLE parameter learning and neutralize the negative effects of training a pipeline with a separate character aligner.

Tasks

Reproductions