SOTAVerified

UralicNLP: An NLP Library for Uralic Languages

2019-05-09Journal of Open Source Software 2019Code Available1· sign in to hype

Mika Hämäläinen

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

UralicNLP is a natural language processing library for small Uralic languages. It can produce morphological analysis, generate morphological forms, lemmatize words and give lexical information about words in Uralic languages. At the time of writing, the following languages are supported: Skolt Sami, Ingrian, Meadow & Eastern Mari, Votic, Olonets-Karelian, Erzya, Moksha, Hill Mari, Udmurt, Tundra Nenets, Komi-Permyak and Finnish. This information originates from FST tools and dictionaries developed in the Giellatekno infrastructure. Currently, UralicNLP uses the nightly builds for languages supported by Apertium and less frequently updated FSTs and CGs for the other languages.

Tasks

Reproductions