SOTAVerified

Automatic acquisition of Urdu nouns (along with gender and irregular plurals)

2014-05-01LREC 2014Unverified0· sign in to hype

Tafseer Ahmed Khan

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

The paper describes a set of methods to automatically acquire the Urdu nouns (and its gender) on the basis of inflectional and contextual clues. The algorithms used are a blend of computer's brute force on the corpus and careful design of distinguishing rules on the basis linguistic knowledge. As there are homograph inflections for Urdu nouns, adjectives and verbs, we compare potential inflectional forms with paradigms of inflections in strict order and gives best guess (of part of speech) for the word. We also worked on irregular plurals i.e. the plural forms that are borrowed from Arabic, Persian and English. Evaluation shows that not all the borrowed rules have same productivity in Urdu. The commonly used borrowed plural rules are shown in the result.

Tasks

Reproductions