SOTAVerified

UZWORDNET: A Lexical-Semantic Database for the Uzbek Language

2021-01-01EACL (GWC) 2021Code Available0· sign in to hype

Alessandro Agostini, Timur Usmanov, Ulugbek Khamdamov, Nilufar Abdurakhmonova, Mukhammadsaid Mamasaidov

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

The results reported in this paper aim to increase the presence of the Uzbek language in the Internet and its usability within IT applications. We describe the initial development of a “word-net” for the Uzbek language compatible to Princeton WordNet. We called it UZWORDNET. In the current version, UZWORDNET contains 28140 synsets, 64389 sense and 20683 words; its estimated accuracy is 75.98%. To the best of our knowledge, it is the largest wordnet for Uzbek existing to date, and the second wordnet developed overall.

Tasks

Reproductions