Getting More Data for Low-resource Morphological Inflection: Language Models and Data Augmentation

2020-05-01LREC 2020Unverified0· sign in to hype

Alexey Sorokin

Unverified — Be the first to reproduce this paper.

Abstract

We investigate how to improve quality of low-resource morphological inflection without annotating more data. We examine two methods, language models and data augmentation. We show that the model whose decoder that additionally uses the states of the langauge model improves the model quality by 1.5\% in combination with both baselines. We also demonstrate that the augmentation of data improves performance by 9\% in average when adding 1000 artificially generated word forms to the dataset.

Tasks

Data Augmentation Decoder Morphological Inflection

Getting More Data for Low-resource Morphological Inflection: Language Models and Data Augmentation

Abstract

Tasks

Reproductions