Sense Vocabulary Compression through the Semantic Knowledge of WordNet for Neural Word Sense Disambiguation

2019-05-14GWC 2019Code Available0· sign in to hype

Loïc Vial, Benjamin Lecouteux, Didier Schwab

Code Available — Be the first to reproduce this paper.

Code

github.com/getalp/disambiguate
OfficialIn paperpytorch★ 0
github.com/Gozzo18/WSD-Final-Homework---NLP
tf★ 0

Abstract

In this article, we tackle the issue of the limited quantity of manually sense annotated corpora for the task of word sense disambiguation, by exploiting the semantic relationships between senses such as synonymy, hypernymy and hyponymy, in order to compress the sense vocabulary of Princeton WordNet, and thus reduce the number of different sense tags that must be observed to disambiguate all words of the lexical database. We propose two different methods that greatly reduces the size of neural WSD models, with the benefit of improving their coverage without additional training data, and without impacting their precision. In addition to our method, we present a WSD system which relies on pre-trained BERT word vectors in order to achieve results that significantly outperform the state of the art on all WSD evaluation tasks.

Tasks

Word Sense Disambiguation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
SemEval 2007 Task 17	SemCor+WNGC, hypernyms	F1	73.4	—	Unverified
SemEval 2007 Task 7	SemCor+WNGC, hypernyms	F1	90.4	—	Unverified
SemEval 2013 Task 12	SemCor+WNGC, hypernyms	F1	78.7	—	Unverified
SemEval 2015 Task 13	SemCor+WNGC, hypernyms	F1	82.6	—	Unverified
Senseval-2	SemCor+WNGC, hypernyms	F1	79.7	—	Unverified
SensEval 3 Task 1	SemCor+WNGC, hypernyms	F1	77.8	—	Unverified
Supervised:	SemCor+WNGC, hypernyms	Senseval 2	79.7	—	Unverified

Sense Vocabulary Compression through the Semantic Knowledge of WordNet for Neural Word Sense Disambiguation

Code

Abstract

Tasks

Benchmark Results

Reproductions