SOTAVerified

A Finite-State Morphological Analyser for Sindhi

2016-05-01LREC 2016Unverified0· sign in to hype

Raveesh Motlani, Francis Tyers, Dipti Sharma

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Morphological analysis is a fundamental task in natural-language processing, which is used in other NLP applications such as part-of-speech tagging, syntactic parsing, information retrieval, machine translation, etc. In this paper, we present our work on the development of free/open-source finite-state morphological analyser for Sindhi. We have used Apertium's lttoolbox as our finite-state toolkit to implement the transducer. The system is developed using a paradigm-based approach, wherein a paradigm defines all the word forms and their morphological features for a given stem (lemma). We have evaluated our system on the Sindhi Wikipedia corpus and achieved a reasonable coverage of 81\% and a precision of over 97\%.

Tasks

Reproductions