SOTAVerified

Comparison of Genres in Word Sense Disambiguation using Automatically Generated Text Collections

2020-09-01CLIB 2020Unverified0· sign in to hype

Angelina Bolshina, Natalia Loukachevitch

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

The best approaches in Word Sense Disambiguation (WSD) are supervised and rely on large amounts of hand-labelled data, which is not always available and costly to create. In our work we describe an approach that is used to create an automatically labelled collection based on the monosemous relatives (related unambiguous entries) for Russian. The main contribution of our work is that we extracted monosemous relatives that can be located at relatively long distances from a target ambiguous word and ranked them according to the similarity measure to the target sense. We evaluated word sense disambiguation models based on a nearest neighbour classification on BERT and ELMo embeddings and two text collections. Our work relies on the Russian wordnet RuWordNet.

Tasks

Reproductions